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Preface 


During the past few years, several different collagenous proteins 
have been isolated from connective tissues, and the collagen family is 
now expanded from type I to type XI. In this volume, each article 
reviews one of the “new” collagens in turn and provides a valuable 
source of information for not only newcomers to the collagen field but 
also all scientists investigating the structure and function ofthe extra- 
cellular matrix. All ofthe articles are written by leading investigators 
directly involved with the appropriate collagen and therefore repre- 
sent the current “state of the art.” 

In the introductory article, Kühn provides a review of the fibril- 
forming collagens (types I, II, and IID) and traces the early work on the 
structure of these collagens to our present knowledge of the structure 
of the collagen genes. This article is followed by a detailed description 
of type IV (basement membrane) collagen by Glanville, who presents 
the evidence that this collagen forms a network of molecules and pub- 
lishes for the first time the complete amino acid sequence of the al(IV) 
chain. For type V collagen, Fessler and Fessler primarily address the 
complex issues of the biosynthesis and chain assembly of the three or 
perhaps four genetically different chains which form this collagen 
type. Timpl and Engel present the evidence that type VI collagen is 
assembled to form tetramers, which subsequently undergo end-to-end 
associations to form the microfibrils present in many connective 
tissues. The subsequent article by Burgeson shows that molecules of 
type VII collagen are assembled to form partially overlapping dimers 
that subsequently undergo lateral associations giving rise to the an- 
choring fibers of the dermis. Endothelial cells, and perhaps some other 
cell types, synthesize a unique collagen called type VIII, and Sage 
and Bornstein present the available information for the structure of 
this collagen. 

Cartilagenous tissues are now known to synthesize at least three 
collagens in addition to type II collagen, and these have been desig- 
nated type IX, type X, and type XI. The structure of type IX collagen is 
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described in the article by van der Rest and Mayne, and for type X 
collagen, Schmid and Linsenmayer show that this collagen is unique to 
hypertrophic cartilage. Type XI collagen was previously called 1a2a3a 
collagen, and Eyre and Wu now present the evidence that this collagen 
can be regarded as a new collagen type. Finally, Vasios et al. review 
the recombinant DNA techniques presently being used to investigate 
collagen structure and show that it is possible to recognize a new 
collagen type from a cDNA library. 

This volume of Biology of Extracellular Matrix: A Series is the first 
time that such a comprehensive review of the different collagen types 
has been presented, and it is hoped that further research will be stimu- 
lated by having all of this information easily available within a single 
volume. 


RICHARD MAYNE 
ROBERT E. BURGESON 


The Classical Collagens: 
Types I, II, and Ill 


Klaus Kühn 


Max-Planck-Institut für Biochemie, Abteilung Bindegewebsforschung, 
D-8033 Martinsried bei München, Federal Republic of Germany 


I. INTRODUCTION 


Modern research on collagen at the molecular level began in the 
1950s. At this time, Highberger et al. (1950) and Schmitt et al. (1953) 
characterized the collagen molecule in the electron microscope, Rama- 
chandran and Kartha (1954) and Rich and Crick (1955) developed 
models for the triple-helical structure of collagen, and finally Boedt- 
ker and Doty (1956) determined the physicochemical properties of the 
collagen molecule in solution. By comparing the cross-striation period 
of 70 nm along the collagen fibril with the length of a collagen mole- 
cule of 280 nm, Schmitt (1956) first developed the concept of a quarter 
staggered array of molecules in the native fibril. Further protein 
chemistry and biochemical investigations led to a detailed knowledge 
of collagen structure and eventually resulted in the discovery, by 
Miller and colleagues (Miller and Matukas, 1969; Miller et al., 1971), 
that the 70-nm-banded collagen fibrils observed in the electron micro- 
scope are formed by three different but homologous collagen types. 
These are type I collagen, the most abundant and, for a long time, the 
only one known; type II collagen, the main collagenous protein of 
cartilage; and type III collagen, which accompanies type I collagen in 
different ratios in almost all tissues. Kefalides (1971) observed that 
basement membranes also contain a specific collagen (type IV), and 
later, Kühn et al. (1981) and Timpl et al. (1981) showed that type IV 
collagen differs markedly in structure from the fiber forming col- 
lagens. Presently, we know that in addition to type IV collagen, which 
forms a nonfibrillar network, the structures of type VI, VII, IX, and X 
collagens are all different from each other and from the fibril-forming 
collagens (Table I) (for a review, see Martin et al., 1985). In recent 
years, abundant knowledge of the structure of these new collagens was 
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provided by biosynthetic approaches, protein chemical, electron micro- 
scopic, immunological, and molecular biological methods. All collagens 
have in common a triple-helical domain, which is combined differently 
with globular and nonhelical structural elements. They all are synthe- 
sized intracellularly as individual molecules or precursor molecules. 
After secretion into the extracellular space, they aggregate with one 
another into supramolecular structures, which subsequently may be- 
come stabilized by intermolecular covalent bonds. The different col- 
lagen types seem to have developed during evolution to provide the 
organism with a variety of supramolecular structures necessary to 
adapt connective tissue to its manyfold mechanical and physiological 
functions. 

This article will deal with the classical fiber-forming collagens: 
types I, II, and III. A chronological description will be made of the 
important discoveries during the course of collagen research in order 
to give an overview of how our knowledge of collagen has evolved in 
the last 30 years. 


ll. EARLY INVESTIGATIONS 


Collagen is the main constituent of skin, the raw material of leather 
and gelatin, and was first investigated by chemists involved in leather 
and gelatin production. They described the characteristic amino acid 
composition, the high glycine content, and the presence of hydroxy- 
proline and hydroxylysine. In addition, the swelling capacity, the ex- 
traordinary tensile strength of collagen fibrils, and the unusual re- 
sistance to proteolytic attack were all investigated (for a review, see 
Grassmann et al., 1951). The first models for the primary structure as 
well as for the arrangement of the polypeptides in the fibril were 
developed by Astbury in 1940. Based on the amino acid composition of 
collagen, he assumed that every third residue along the peptide chain 
is glycine and every ninth residue is hydroxyproline. The X-ray reflec- 
tion pattern of collagen fibrils, which showed a strong meridian reflec- 
tion of 2.89 Ä (Herzog and Gonnel, 1924) and an equatorial reflection 
which varied between 10.5 and 16 Ä depending on the degree of swell- 
ing of the fibers (Küntzel and Prakke, 1933), led to models with paral- 
lel oriented, fully extended peptide chains. 

Another characteristic property of collagen fibers was independent- 
ly discovered by Hall et al. (1942) and Wolpers (1943). In the electron 
microscope, the fibers showed a cross-striation period of 60-70 nm, 
similar to the identity period of 67 nm observed in low-angle X-ray 
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reflection patterns (Bear, 1944). Electron micrographs of collagen 
fibrils at higher resolution after staining with phosphotungstic acid 
(PTA), uranyl salts, and chromium complexes later showed a subdivi- 
sion of the identity period with a characteristic cross-striation pattern 
containing 13 dark bands (Nemetschek et al., 1955). Several theories 
were developed to explain this pattern, but only Bear’s (1952) came 
near the truth. He assumed that the dark bands consist of accumulat- 
ed polar amino acids with long side chains, able to bind heavy metal 
ions, and that the light interbands contain mainly apolar residues with 
short side chains. 


Ill. ELECTRON Microscopic INVESTIGATIONS 


A. The Collagen Molecule 


Three parallel developing branches of collagen research led finally 
to an understanding of collagen structure at the molecular level. Exact 
high-angle X-ray diffraction patterns of stretched tendons obtained by 
Cowan et al. (1953) and the exact determination of the coordinates of 
the peptide chain by Pauling et al. (1951), such as the atomic distances 
and the angles between the individual bonds, enabled Ramachandran 
and Kartha (1954) and Rich and Crick (1955) to develop a model of the 
triple helix. It was proposed that each polypeptide chain is arranged in 
a stretched polyproline helix whose mobility is restricted by the pro- 
line rings. The fact that every third position in the collagen chains is 
occupied by a glycine residue allows the arrangement of three chains 
around a common axis to the triple helix, which is stabilized by hydro- 
gen bonds between the chains perpendicular to the helix axis (re- 
viewed by Ramachandran and Ramakrishnan, 1976). 

Nageotte (1927) observed that acidic extracts of embryonic connec- 
tive tissue contained collagenous proteins which could be reaggregated 
to fibrillar structures. Such collagen extracts were reinvestigated by 
Schmitt and colleagues with the help of the electron microscope. They 
found that acid-soluble collagen will aggregate at neutral pH to fibrils 
with the typical cross striation of native fibrils (Schmitt et al., 1942). 
Depending on the experimental conditions used, two additional aggre- 
gates absent in tissues were observed. In the presence of highly nega- 
tively charged compounds, such as hyaluronic acid or chondroitin sul- 
fate, the collagen molecules do not form fibrils of the native type. 
Because of their interactions with the negatively charged compound, 
they are forced to aggregate to the so-called long spacing fibrils (FLS) 
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with an axial period of 180-250 nm and a centrosymmetrical cross- 
striation pattern (Schmitt et al., 1953). By dialyzing neutral phosphate 
extracts of connective tissue against acetic acid, another artificial ag- 
gregate, the segment long spacing (SLS) crystallites (Schmitt et al., 
1953) were observed. They were 250-280 nm long and, after staining 
with phosphotungstic acid, they depicted an asymmetric cross-stri- 
ation pattern (see Fig. 1b). Highly negatively charged substances, such 
as nucleic acid or nucleotides present in the phosphate extract, were 
responsible for the precipitation of collagen at acidic pH. ATP was 
found to be the best reagent for the formation of SLS crystallites from 
collagen in acetic acid solution. An important finding was that all 
three macromolecular forms, the native fibrils, FLS, and SLS aggre- 
gates, were interconvertible by dissolution and subsequent reaggrega- 
tion under certain conditions (Schmitt et al., 1953). Thus, it is possible 
for the three different forms to be assembled by the same molecule 
with a length of about 250-280 nm. 

These assumptions were confirmed by the physicochemical charac- 
terization of collagen in solution by Boedtker and Doty (1956). They 
also described the collagen molecule as a thin rod with a length of 280 
nm. Putting together the X-ray, electron microscopic, and physico- 
chemical investigations, it followed that the collagen molecule consists 
of three polypeptide chains each with a length of about 1000 amino 
acid residues. They are arranged in a triple-helical structure that gives 
the molecule both its rod-like form and its rigid properties. The diame- 
ter of the collagen molecule, 1.4 nm calculated from the triple-helical 
structure, was in agreement with the physicochemical investigations 
and the X-ray data. Finally, Hall and Slayter (1959) observed indi- 
vidual collagen molecules in the electron microscope using the shadow- 
ing technique. The molecules were visible as thin and flexible rods 
with a length of 280-300 nm (see Fig. 1a). 

Particularly helpful for further investigation of the molecular and 
macromolecular structure of collagen were the SLS crystallites, which 
turned out to be the electron microscopic image of the collagen mole- 
cule. Positive staining with PTA and uranyl acetate produced an 
asymmetric cross-striation pattern with 58 bands (Fig. 1b). Since PTA 
reacts with the positively charged side chains and uranyl cations react 
with negatively charged side chains of the polar amino acid residues 
(Kühn et al., 1958; Gebhardt and Kühn, 1963), it was suggested that 
the cross striation pattern of SLS crystallites reflects the distribution 
of these residues along the polypeptide chains. Therefore, SLS crystal- 
lites were used for an initial comparison of the primary structure of 
the different collagens before the actual amino acid sequence was 
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known. Initially, collagens from anımals of various phylogenetic 
stages were compared. Crystallites were prepared from pepsin-dis- 
solved collagen of sea anemone, liver fluke, and carp. Surprisingly, 
their cross-striation patterns were identical with those prepared from 
bovine collagen. This revealed a remarkable stability of the amino 
acid sequence during evolution, at least in as far as the distribution of 
the polar charged residues (Nordwig and Hayduk, 1969) was con- 
cerned. This notion was verified when the first continuous stretch of 
amino acid sequence of some length of the «1(I) chain became known 
(von der Mark etal., 1970). An excellent correlation between the amino 
acid sequence and the cross-striation pattern was observed. Each band 
could be attributed to a distinct cluster of polar amino acids. The 
correlation was so exact that, in some cases, individual arginine resi- 
dues were reflected by a single dark band in the cross-striation pattern 
(Fig. 1c). 


B. The Structure of the Native Fibril 


In 1956, Schmitt suggested that the 70-nm-long cross-striation peri- 
od of the native collagen fibril is caused by a parallel but quarter- 
staggered array of the 280-nm-long molecule, so that the band pattern 
of one period is caused by the superimposed position of four different 
quarters of the collagen molecule. This hypothesis was experimentally 
confirmed by Hodge and Schmitt (1960) and Kühn et al. (1960). By 
superimposing the cross-striation pattern of four quarter staggered 
SLS crystallites in a way in which the dark bands matched best, a 
composite cross-striation pattern corresponding to that obtained with 
native fibrils was achieved. Since the dark bands represent areas with 
polar charged amino acid residues, it was assumed that the aggrega- 
tion of the molecules is determined by the sequence of the charged 
residues with the help of electrostatic forces (summarized by Kühn, 
1982). 

In 1964, Petruska and Hodge found that this kind of construction 


Fic. 1. Electron microscopic image of the collagen type I molecule. (a) Individual 
molecules visualized by the rotary-shadowing method. The length of the molecule is 300 
nm. (b) Segment long spacing crystallites stained with phosphotungstic acid and uranyl 
acetate. The molecules are parallel with their ends in register. (c) Comparison of the 
amino acid sequence of the a1(I) and «2(I) chain (position 120-405) with the correspond- 
ing cross-striation pattern of SLS crystallites. There is an exact correlation between the 
distribution of the polar charged amino acids and the dark bands in the cross-striation 
pattern (from Fietzek and Kühn, 1976). 
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Fic. 2. The array of the collagen molecules in a native fibril. (a) Schematic represen- 
tation of the parallel, D staggered arrangement of the collagen molecules in the fibril. 
(b) Collagen fibril, negatively stained with phosphotungstic acid. The dark and light 
areas reflect loosely and densely packed hole (Ho) and overlap zones (Ov), respectively. 
(c) Collagen fibril, positively stained with phosphotungstic acid and uranyl acetate, 
depicting the characteristic cross-striation pattern with the identity period of 67 nm. 


did not reflect the exact arrangement of the end regions of the collagen 
molecules. In negatively stained fibrils, the 67-nm-long period is divid- 
ed into a dark and a light area reflecting loosely and densely packed 
regions, respectively. With more exact determination of the length of 
SLS crystallites after negative staining, they found that the collagen 
molecule is not 4 but 4.4 D units (D = the length of one cross-striation 
period of 67 nm) and that, in the dense area of the period, the mole- 
cules overlap with their ends by about 30 nm (Fig. 2). It is striking that 
these electron microscopic investigations led to an exact determination 
of the arrangement of the molecules in a native fibril without a de- 
tailed knowledge of the primary structure. A decade later these results 
were confirmed by calculation of the interaction patterns of the mole- 
cules using a three-dimensional analysis of the amino acid sequence 
(Trus and Piez, 1976; Hofmann et al., 1978) and new interpretation of 
X-ray diffraction patterns (Hulmes and Miller, 1979). 
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IV. PROTEIN CHEMICAL INVESTIGATION 


In the 1950s, investigation on the primary structure of collagen also 
began. However, the elucidation of the amino acid sequence of a pro- 
tein as large as collagen, with an M, of 360,000, appeared to be an 
almost unsolvable problem with the available protein chemical meth- 
ods. First attempts were made by Schroeder et al. (1953, 1954). Col- 
lagen was fragmented by partial alkaline and acidic hydrolysis, and 
tri- and tetrapeptides were isolated and sequenced. The results gave 
the first indications of both a regular distribution of glycine along the 
peptide chain and the presence of the sequence Gly-Pro-Hyp. These 
experiments were extended by Grassmann and co-workers who inves- 
tigated peptides obtained after tryptic digestion of denatured bovine 
collagen, and who observed a prevalence of the common sequence Gly- 
X-Y (for a review, see Grassmann et al., 1965). 

The basis for a systematic investigation of the amino acid sequence 
of collagen was provided by Piez and co-workers in 1963. It was shown 
that during denaturation the collagen molecules disintegrate into a 
complex mixture of monomeric « chains, di- and trimeric components, 
and higher fractions, which reflect the presence of polymeric collagen 
molecules in solution. In a now classical article, Piez et al. (1963) de- 
scribed the separation of these components by chromatography on car- 
boxymethylcellulose and showed that the collagen molecule consists of 
two al and one a2 chain. Further progress was achieved by Piez and 
colleagues after applying the cyanogen bromide (CNBr) cleavage 
method to the purified « chains, which splits polypeptides specifically 
at the methionine residues (Bornstein and Piez, 1966). This resulted in 
a clear mixture of relatively few fragments which, after separation, 
was used as starting material for a systematic sequence analysis of the 
al and the a2 chains (Summarized by Fietzek and Kühn, 1976). Initial 
sequence studies were carried out mainly on the small cyanogen bro- 
mide peptides of rat collagen with the manual Edman degradation 
procedure (Kang et al., 1967; Bornstein and Piez, 1966). These studies 
were accelerated by the introduction of the automated degradation 
procedure developed by Edman and Begg (1967). In 1972 the primary 
structure of an entire a1(I) chain was known (see Fietzek and Kühn, 
1976). The sequence data, partly from rat and partly from bovine 
collagen, revealed a peptide chain with a continuous triple-helical do- 
main of 1014 residues and nonhelical telopeptides at the N- and C- 
terminus of 16 and 25 residues in length, respectively (see below). 

The exact knowledge of the CNBr pattern of the al and the «2 chain 
provided a basis of information for the discovery of type II and III 
collagens by Miller and co-workers. During an investigation of ex- 
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tracts of collagen from lathyritic chick cartilage, Miller and Matukas 
(1969) found a much higher proportion of a1 to a2 than the expected 
ratio of 2:1. Cyanogen bromide cleavage of the isolated a1 chain from 
cartilage depicted a peptide map different from that of the al and a2 
chains investigated previously. This new, genetically distinct collagen 
type is now known to be the major collagenous component of hyaline 
cartilage. It was first called type II collagen by Miller, whereas the 
“old” collagen, the main constituent of skin, bone, and tendon, became 
type I collagen. The type II collagen molecule consists of three identi- 
cal chains which were designated as al(II) (Miller, 1971). The SLS 
crystallites of type II collagen showed a band pattern similar to that of 
type I collagen, indicating a strong sequence homology of both collagen 
types (Stark et al., 1972). A characteristic difference was found in the 
higher content of hydroxylysine residues (see Miller, 1976) and the 
higher rate of glycosylation of a1(ID. 

Type III collagen was found by Miller et al. (1971) in pepsin-dis- 
solved skin collagen. They observed an « chain which differed in its 
chromatographic behavior on carboxymethylcellulose as well as in its 
CNBr peptide map from a1(I), «2(D and a1 (ID). It turned out that three 
of these al(III) chains form the molecule of type III collagen which 
accompanies type I collagen in many connective tissues except tendon 
and bone (see Miller, 1976). Electron-microscopic investigation of the 
band pattern of type III collagen SLS crystallites indicated a clear 
homology to type I and type II collagen (Wiedemann et al., 1975). 


TABLE II 
STEPS OF POSTTRANSLATIONAL PROCESSING OF COLLAGEN 


Modification 


Biological importance 


Removal of the signal sequence 
4-Hydroxylation of proline 


3-Hydroxylation of proline 
Hydroxylation of lysine 


Glycosylation of hydroxylysine 

Chain association, disulfide bonding, 
and triple-helix formation 

Conversion of procollagen to collagen 


Self assembly of collagen molecules 
Cross-link formation 


Unknown 

Regulation of the stability of the tri- 
ple helix 

Unknown 

Stabilization of intermolecular cross- 
links and hydroxylysyl glycosyla- 
tion 

Regulation of fiber diameter? 

Formation of the triple-helical pro- 
collagen molecule 

Prerequisite of fibril formation and 
regulation of fibril formation 

Fibril formation 

Stabilization of fibrils 
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Amino acid sequence analysis of the newly discovered types II and 
III collagens was finally accelerated by applying recombinant DNA 
technology. The first amino acid sequences of collagen deduced from 
al(I)- and a2(I)-specific cDNA were published by Fuller and Boedtker 
(1981). Today, the covalent structure of all three fiber-forming col- 
lagens is elucidated. However, before the relationship of the amino 
acid sequence, higher structures, and function of collagen can be dis- 
cussed, it will be necessary to describe the posttranslational processing 
events which collagen undergoes on its way from the biosynthesis of 
the polypeptide chain to the formation of macromolecular structure in 
the extracellular space (Table ID). 


V. BIOCHEMICAL INVESTIGATIONS 


A. Hydroxylation of Proline and Hydroxylysine 
and Glycosylation of Hydroxylysine 


The first indications for a posttranslational modification of the ami- 
no acids in collagen was the occurrence of the specific amino acid 
residues hydroxyproline and hydroxylysine, and the observation of 
Stetten (1949) that radioactively labeled hydroxyproline cannot be in- 
corporated into collagen. These results were confirmed by Peterkofsky 
and Udenfriend (1963, 1965), who could not find specific amino 
acyladenylate synthetases for hydroxyproline and hydroxylysine. 
They were able to show that hydroxylation did not take place at the 
stage of the amino acyl complexes of the tRNA but after incorporation 
of the residues in the peptide chain. Meanwhile, it became clear that 
the hydroxylation of not only proline but also of lysine is catalyzed by 
two separate enzymes, the proline 4-hydroxylase and the lysyl 
hydroxylase (see Kivirikko and Myllylä, 1980, 1985; Majamaa et al., 
1985). Both are located on the inner side of the membrane of the rough 
endoplasmic reticulum. The prolyl as well as the lysyl hydroxylases 
require Fe?*, 2-oxoglutarate, O,, and ascorbate. They all decarboxy- 
late 2-oxoglutarate into succinate, one atom of the O, molecule being 
incorporated into succinate while the other is transferred as a hydrox- 
yl group to the amino acid residues. Both enzymes catalyze the oxida- 
tion ofresidues in position Y of the tripeptide unit Gly-X-Y. An excep- 
tion has been observed in the non-triple-helical telopeptides of the a 
chains, where lysine residues are hydroxylated without being present 
in a tripeptide region. These residues are particularly important for 
the formation of stabilized intermolecular cross-links. 

There is also a prolyl-3-hydroxylase which oxidizes proline residues 
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mainly in position X (see Cardinale and Udenfriend, 1974). In the 
fiber-forming collagens, this has been observed only once in the al(I) 
chain in the sequence Gly-3Hyp-4Hyp (Fietzek and Kühn, 1976). In 
invertebrate collagens, 3Hyp has been found more often (Adams, 
1978). 

The presence of D-galactose and D-glucose covalently linked to col- 
lagen was shown by Grassmann et al. (1957), Gross et al. (1958), and 
Kühn et al. (1959). In 1966, Butler and Cunningham demonstrated 
that the carbohydrate component of collagen is bound to the hydroxyl 
group of hydroxylysine in the form of a disaccharide. Its structure, a 
2-O-a-D-glucopyranosyl-O-B-D-galactopyranosylhydroxylysine was 
elucidated by Spiro (1967). The formation of this collagen-specific car- 
bohydrate group is catalyzed by two specific enzymes. First, galactose 
is transferred to a hydroxylysine residue by the enzyme galac- 
tosyltransferase. Then, a glucose is transferred to the galactosy] mono- 
saccharide by the enzyme galactosylhydroxylysyl glucosyltransferase 
(Spiro and Spiro, 1971a,b; Kivirikko and Myllyla, 1979). Both enzymes 
use as a donor the corresponding UDP-glycosides and require the pres- 
ence of a bivalent cation, such as manganese. 

All five posttranslational enzymes, the three hydroxylases as well as 
the two glycosyltransferases can only modify the collagen polypeptide 
chain before formation of the triple helix occurs. In this connection, it 
is striking that the 4-hydroxy] group of the hydroxyproline residues in 
position Y of the tripeptide Gly-X-Y has an important function in the 
stabilization of the triple-helical structure, and that the degree of hy- 
droxylation of proline appears to regulate triple-helix formation. A 
triple helix formed by nonhydroxylated « chains exhibits a transition 
temperature of about 15°C lower than the ¢,, of a triple helix of nor- 
mal, hydroxylated collagen (Berg and Prockop, 1973). This indicates 
that new or underhydroxylated collagen is not able to form a triple- 
helical molecule under physiological conditions at a temperature of 
37°C. When hydroxylation has reached a certain degree, then triple- 
helix formation takes place and stops the enzymes. It has been specu- 
lated that, in this way, the tm value of the collagen molecules are 
adjusted to the body temperature. 

Taking into account the important role of hydroxyproline, attempts 
have been made to interfere in fibrotic diseases with an uncontrolled 
overproduction of collagen, such as liver cirrhosis or lung fibrosis. One 
approach was to inhibit proline 4-hydroxylase with inhibitors specifi- 
cally designed for the active site of this enzyme (Hanauske-Abel and 
Gunzler, 1982). Another approach was to give experimental animals 
an excess of proline analogs, such as cis-4-fluoro-L-proline or 3,4-de- 
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hydroproline, which are incorporated in place of proline into the pep- 
tide chains during protein synthesis and which cannot be hydroxylated 
at position 4 (Uitto and Prockop, 1975). 

The role of the collagen-specific disaccharide units attached to the 
hydroxylysine residues of the triple-helical domain remains unclear. 
Their number varies from collagen to collagen. Type II collagen, for 
example, often forms fibrils of small diameter and contains more car- 
bohydrates than type I collagen, which preferentially builds thick 
fibrils (see Miller, 1976). A high number of the bulky disaccharide 
groups appears to hinder the formation of highly ordered fibrils with a 
large diameter. Type IV collagen, which forms a nonfibrillar network 
(Timpl et al., 1981), is characterized by a high content of hydroxylysine 
residues, which are all occupied by the galactosylglucosy] disaccharide 
(see Glanville, this volume). 


B. The Precursor Form 


For a long time, it remained unnoticed that the a chains of the fiber- 
forming fibrils are synthesized in a precursor form. First indications 
came from biosynthetic investigations of collagen in cultures of fibro- 
blast and rat calvaria (Layman et al., 1971; Bellamy and Bornstein, 
1971), when collagenous polypeptide chains with an M, of 120,000 
instead of 95,000 were observed. A similar precursor form was isolated 
from the skin of calves with the inherited disease dermatosparaxis by 
Lenaers et al. (1971). SLS crystallites from procollagen isolated from 
dermatosparactic calves and from the medium of fibroblast cultures 
revealed an additional extension peptide at the amino-terminus of the 
collagen molecule. The carboxy end of the molecule, however, did not 
appear to be changed (Stark et al., 1971). It turned out later that these 
dermatosparactic animals had a defect in the amino-terminal pro- 
collagen peptidase, a proteolytic enzyme involved in the removal of the 
amino-terminal propeptide during conversion of procollagen. Further 
biosynthetic studies using tendon fibroblasts from chick embryos re- 
vealed that this precursor form did not reflect the genuine procollagen 
molecule. Procedures which prevented the action of endogenous pro- 
teolytic enzymes by adding protease inhibitors and avoiding acidic pH 
during the isolation procedure resulted finally in the isolation of an 
intact precursor, a precursor which contained propeptides not only at 
the amino-terminal but also at the carboxy-terminal end (Hoffmann et 
al., 1976). 

The discovery of the two propeptides completed our knowledge of the 
structure of collagen. It was thus shown that for the construction of 
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the genuine procollagen molecule not only triple-helical areas but also 
globular domains and nonhelical sections are used. However, in the 
case of the fiber-forming collagens, the globular end regions are not 
deposited in the final fibrillar structure. They may be used as control 
elements important during the posttranslational development of col- 
lagen. However, in other collagen types, for instance in type IV and 
type VI collagens, such globular domains are used as integral parts of 
the final macromolecular structure of the collagen (Timpl et al., 1981; 
von der Mark et al., 1984). 

The bulky globular domains of the propeptides of the intact collagen 
molecule may function as a transport form and prevent fiber forma- 
tion inside the cell. An important posttranslational event is therefore 
the conversion of the precursor form to the active molecule. Only when 
the propeptides are split off extracellularly by two specific enzymes— 
the procollagen amino and carboxy proteinases—can the collagen mol- 
ecules aggregate in an ordered D staggered array to form native 
fibrils. In the skin of dermatosparactic calves or sheep, where the 
amino propeptide is not removed because of a defective enzyme, the pN 
molecules do not form intact fibrils but aggregate into distorted rib- 
bons which are not able to provide the skin with the required bio- 
mechanical stability. As will be discussed below, the timing of the 
cleavage of the two propeptides seems to be important for the regula- 
tion of fiber diameters. Also, other functions of the propeptides have 
been proposed. The amino propeptide appears to play a role in the 
feedback control of collagen biosynthesis (Wiestner et al., 1979), 
whereas the carboxy propeptide is involved in the alignment of three a 
chains to facilitate the formation of the triple helix (Fessler et al., 
1981) (see below). 


C. Formation of Cross-Links 


An important posttranslational modification, which occurs outside 
the cell after aggregation of the molecules, is the formation of inter- 
molecular cross-links responsible for the mechanical stability of fibrils 
and the fiber network. Investigation of this complex event at the mo- 
lecular level was initiated by the observation of Gross et al. (1960) that 
a marked increase in the amount of soluble collagen occurs when B- 
aminopropionitrile (ß-APN) is administered to growing animals. B- 
APN was found to be the active compound of the sweet pea, Lathyrus 
odoratus, which causes the connective tissue disorder osteolathyrism. 
Amino acid sequence analysis of rat collagen later revealed that, in 
lathyritic collagen, the single lysine or hydroxylysine residue in the 
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amino-terminal telopeptide, in contrast to the collagen from healthy 
animals, is not oxidized to the lysine-derived allysine. Therefore, the 
intramolecular cross-link which leads to the B} component linking 
two collagen chains cannot be formed (Bornstein et al., 1966; Bornstein 
and Piez, 1966; see also Piez, 1968). The same is true for the carboxy- 
terminal telopeptide which also contains an oxidizable lysine or hy- 
droxylysine residue (Rauterberg et al., 1972). This suggested that ß- 
APN inhibits the formation of aldehydes from lysyl residues and, thus, 
the formation of cross-links. 

Investigation of some of the cross-link compounds formed was facili- 
tated by reductive labeling with tritiated sodium borohydride (see 
Bailey et al., 1974; Tanzer, 1976). The major cross-links found in boro- 
hydride-reduced collagen were hydroxylysinonorleucine and di- 
hydroxylysinonorleucine, produced by the condensation of allysine 
with hydroxylysine or hydroxyallysine with hydroxylysine to form a 
Schiffs base. It is an important fact that the degree of hydroxylation 
of the one lysine residue in each telopeptide determines the stability of 
the cross-link. If an aldehyde derived from hydroxylysine is involved 
in cross-linking, the Schiffs base initially produced undergoes an 
Amadori rearrangement to a stable ketoimine form (Fig. 3). If a 
lysine-derived allysine is the partner, a rearrangement of the formed 
aldimine bond is not possible; the cross-link remains labile and can, for 
example, be opened again by acidic pH (Robins and Bailey, 1975). This 
fact explains the differences in solubility of collagen from different 
tissues in dilute organic acids. Skin collagen, with a relatively low 
degree of hydroxylation of lysine in the telopeptides, is more readily 
soluble than collagen from bone and cartilage, which contains more 
hydroxylysines in the telopeptides. In addition to the described di- 
valent cross-links, a number of tri-and tetravalent condensation prod- 
ucts have been isolated which also can be reduced by sodium borohy- 
drate (see Tanzer, 1976; Robins and Bailey, 1977). During maturation 
and aging of collagen, the reducible cross-links are gradually convert- 
ed into nonreducible compounds in a manner that is as yet not clear. 
The possibility of in vivo reduction has been excluded (see Eyre, 1980). 
Other stabilization mechanisms have been suggested more recently 
involving oxidation to yield a peptide bond (Bailey et al., 1977) or to 
form a multivalent cross-link of unknown nature (Light and Bailey, 
1979). 

From the D staggered array of the collagen molecules in the fibril, 
the allysine or hydroxyallysine residues in the amino- and carboxy- 
terminal telopeptides are in the neighborhood of four different areas 
of the triple helix, where they are able to find a lysine or hydrox- 
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Fic. 3. Cross-link formation between a hydroxyallysine and a hydroxylysine. The 
resulting dehydrodihydroxylysinonorleucine is subsequently stabilized by an Amadori 
rearrangement to a ketoimine compound. If an allysine is the reaction partner so that 
the hydroxyl group is missing, then the rearrangement cannot take place and the al- 
dimine bond can be opened again, e.g., by acidic pH (Robins and Bailey, 1973). 


ylysine residue as reaction partner to form an aldimine cross-link. 
Inspection of the amino acid sequence of type I, II, and III collagens 
revealed, for the amino-terminal allysine, that a reaction partner ex- 
isted only in the 4D staggered array, namely, hydroxylysine-930. The 
situation is similar with the carboxy terminal allysine. The 4D stag- 
gered arrangement provides hydroxylysine-87 as partner in the 1D 
position, and lysine-327 may be a putative partner too (see Fietzek and 
Kühn, 1976). In order to localize the cross-links, insoluble collagen was 
fragmented by cyanogen bromide or by proteolytic attack, and cross- 
linked peptides were isolated and characterized (summarized by 
Fietzek and Kühn, 1976; Light and Bailey, 1980). In accordance with 
the amino acid sequence of types I, II, and III collagen, only those 
peptides which can be formed between 4D staggered molecules were 
found. As will be discussed below, the two helical cross-link regions 
around the hydroxylysine residues 87 and 930 represent a special, 
evolutionarily stable amino acid sequence which may also act as an 
attachment site for the enzyme lysyl oxidase. 

The initial step for the formation of cross-links, the oxidation of 
lysine or hydroxylysine, is catalyzed by the copper-dependent lysyl 
oxidase (reviewed by Siegel, 1979). Siegel (1974) found that this en- 
zyme has no effect on dissolved collagen molecules. The prerequisite 
for its catalytic function is aggregated molecules. This observation led 
to the suggestion that lysyl oxidase does not bind to the molecule 
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which is oxidized but to distinct sites of the neighboring molecule (see 
below) and, therefore, that a fibrillar collagen is required for enzyme 
action. 


VI. THE COVALENT STRUCTURE OF THE PROCOLLAGENS 


In this paragraph those areas of the primary structure of pro- 
collagen chains, which determine and control the formation of the 
molecular and macromolecular structure and which provide a better 
understanding of the complex sites of posttranslational events and 
metabolism of collagen, will now be discussed. Figure 4 depicts sche- 
matically the covalent structure of the «al(IID chain. Because of the 
strong homology between the fiber-forming collagen types I, II, and 
III, it can be taken as representative for all a chains [«L(I), «2(I), 
al(ID, and «al(IID] (Hofmann et al., 1980). The triple-helical domains 
ofthe « chains were mainly sequenced at the protein level (summa- 
rized by Fietzek and Kühn, 1976, and Galloway, 1982). This’is also 
true for the amino propeptides of a1(I) and al(III), which could be 
extracted from the skin of dermatosparactic or embryonic calves, re- 
spectively (Hörlein et al., 1979; Brandt et al., 1984). The sequences of 
the carboxy propeptides of all a chains (Fuller and Boedtker, 1981; 
Yamada et al., 1983a; Sandell et al., 1984), as well as of the amino 
propeptides of «2(I) and a2(ID, which were available only in small 
amounts, have been deduced from the DNA sequences (Tate et al., 
1983; Kohno et al., 1984). The triple-helical domain, inclusive of the 
non-triple-helical short appendices at both ends, contains all the struc- 
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Fica. 4. Schematic representation of the prepro a1(III) chain of collagen. The amino 
propeptide, the amino telopeptide, and the triple-helical domain of bovine (Brandt et al., 
1984; Fietzek et al., 1979; Dewes et al., 1979a,b; Bentz et al., 1979; Lang et al., 1979; 
Allmann et al., 1979) as well as the triple helix of human (Seyer and Kang, 1977, 1978, 
1981; Seyer et al., 1980) collagens were sequenced at the protein level. The sequence of 
the signal peptide, carboxyl telopeptide, and the carboxyl propeptide were deduced from 
chick genomic DNA (Yamada et al., 1983a,b). The arrows indicate the cleavage sites of 
the amino and carboxyl procollagen peptidase. R, Amino acid residues. 
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tural information for the construction of a cross-linked fibril in the 
extracellular space. The amino- and the carboxy-terminal propeptides, 
which are cleaved off before or during fibrillogenesis, appear to be 
important as control elements for the posttranslational development 
of collagen, and possibly also for a feedback regulation of collagen 
biosynthesis (Fessler et al., 1981; Wiestner et al., 1979). 


A. The Triple-Helical Domain 


The sequence of the main and central part of the a chain is essential 
for the formation of the molecular and macromolecular structure of 
collagen. It has the ability to form a triple helix with two other a 
chains, which gives the molecule a rodlike shape, and subsequently 
each molecule has the capability to aggregate in a highly ordered 
fashion to form fibrils. The formation of a continuous triple helix is 
dependent on an uninterrupted tripeptide sequence Gly-X-Y, whose 
length varies between 1014 residues in al(I) and 1023 residues in 
al(IID (see Galloway, 1982). The presence of the imino acids proline 
and hydroxyproline is important for the stability of the triple helix. 
There are three types of tripeptides containing imino acids, which 
contribute to the stability of the helix in the following order: Gly-Pro- 
Hyp > Gly-Pro-Y and > Gly-X-Hyp (Segal, 1969). The rigidity and the 
flexibility of distinct areas along the molecule can thus be adjusted by 
varying amounts of proline and hydroxyproline. A typical example is 
the carboxy-terminal end of the triple-helical domain of the a1(D and 
a2(I) chain, where the helix is sealed by a 5-fold repeat of the triplet 
Gly-Pro-Hyp. Two other regions of the molecule, the collagenase cleav- 
age site and the carboxy-terminal helical cross-link area, both of 
which require a more relaxed and flexible structure (see below), do not 
contain tripeptide units with Pro or Hyp. 


1. FORMATION OF FIBRILS 


Early electron microscopic investigations showed that the molecules 
in the native fibril are staggered from one another by a distance of 60— 
70 nm (Petruska and Hodge, 1964). This specific arrangement is main- 
ly brought about by electrostatic forces determined by the sequence of 
the basic and acidic amino acids. Knowledge of the amino acid se- 
quence allowed a more accurate analysis of the intermolecular interac- 
tions. Statistical evaluation of the sequence of a1(1), «2(D, and al (III) 
revealed that the 234-amino-acid-residue-long D unit was the most 
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prominent repeat. It is formed mainly by polar charged and hydro- 
phobic residues and divides the « chains into four homologous parts, 
D1, D2, D3, and D4 (Hulmes et al., 1973; Hofmann et al., 1980). Three- 
dimensional calculations of the interaction between collagen I and 
collagen III molecules, respectively, indicated not only the highest pol- 
ar but also the highest hydrophobic contact when the molecules were 
shifted against each other by the distance of 234 amino acid residues 
(Hofmann et al., 1978). These results confirmed early electron micro- 
scopic investigations and revealed, in addition, an important role of 
the amino acid residues with hydrophobic character. The fact that the 
polar as well as the hydrophobic amino acid residues are accumulated 
in polar and hydrophobic areas, respectively, seems to be important for 
stabilization of the collagen fibril structure. The interaction with 
neighboring molecules allows the formation of ordered clusters of ions 
(Piez and Torchia, 1975) and of hydrophobic side chains, which results 
in a higher stability than is possible if only single residues interacted 
with one another. 

Beside the axial arrangement of molecules in the fibril, their lateral 
array in a fibril is also important. Here, no clear relationship between 
the amino acid sequence and lateral arrangement of the molecules can 
be found, which indicates that the lateral array is not as strongly 
determined by the amino acid sequence as is the axial stagger of the 
molecules. This is in agreement with experimental observations. Ac- 
cording to observations of the X-ray reflection pattern of tendon, the 
molecules seem to have a pseudohexagonal array (Hulmes and Miller, 
1979) which can be easily disturbed, for example, by the swelling and 
unswelling of tendon (Nemetschek, 1969) or by staining with PTA 
(Nemetschek and Hosemann, 1973; Nemetschek et al., 1979). The pos- 
sibility of varying the azimuthal orientation of the molecules in a 
fibril appears to be an important structural feature of collagen, pre- 
venting the formation of crystal-like, rigid, nonfunctional fibrillar 
structures, and perhaps facilitating fiber formation by interaction 
with other extracellular matrix proteins. 


2. THE Cross-LinK SITES 


In addition to the amino acid sequence, which regulates the forma- 
tion of higher structures of collagen, there are also specific regions 
along the triple helix, which are important for the stabilization of the 
collagen fibril by cross-linking, for the degradation of the fibrils by 
collagenase and for the interaction of collagen with cells and other 
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Fic. 5. (a) Schematic representation of the cross-link sites in the D/3-long end-to-end 
overlap region of 4D-staggered collagen molecules (see Kühn, 1982). Lysine (Lys) or 
hydroxylysine (Hyl) 5N and 16€ are the active sites of the N- and C-terminal nonhelical 
regions. They have to be oxidized before cross-link formation can occur. Hyl-87 and -930 
are the active sites of the triple-helical cross-link regions which serve as reaction part- 
ners for the lysine- or hydroxylysine-derived aldehydes. (b) The amino acid sequences in 
the vicinity of the hydroxylysine residues 87 and 930 are evolutionarily very stable, as is 
shown for the a-chains of the bovine collagens I and III. 


matrix constituents. Cross-linking sites are characterized by low pro- 
line and hydroxyproline content and the presence of the sequence Hyl- 
Gly-His-Arg, which appears to be important as an attachment site for 
the enzyme lysyl oxidase (Fig. 5). It is thought that, after ordered 
assembly of the collagen molecules, this enzyme attaches first to the 
helical cross-link region before it can oxidize the hydroxylysine in the 
telopeptide of a D staggered adjacent molecule. Another characteristic 
feature of this area is the relaxed triple helix, caused by the absence of 
proline and hydroxyproline residues. This is important for the packing 
of the molecules into fibrils. Because of the end-to-end overlap of the 
4D staggered molecules, a tight packing requires bending of the mole- 
cules right in the cross-link area, which comes to lie at the borders 
between the hole and the overlap zone (Hofmann et al., 1980). 


3. THE COLLAGENASE CLEAVAGE SITE 


The intact triple helix is resistant to proteolytic attack. Incorpora- 
tion of the molecules into fibrils and the subsequent cross-link forma- 
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tion result in a further stabilization of the molecules, so that the 
denaturation temperature of an individual molecule rises from about 
40 to more than 60°C. In the fibril the triple-helical molecule can be 
cleaved only by a specific enzyme, the mammalian collagenase, which 
cleaves the molecule at a distinct position between glycine 775 and 
isoleucine 776 into two pieces, TC“ and TCE (three-fourths and one- 
fourth of the length of the molecule, respectively) (Gross et al., 1974; 
Miller et al., 1976; Gross and Nagai, 1965). The cleavage site is located 
in an area with a relatively low content of imino acids, and the relaxed 
triple helix is thought to facilitate the collagenase action (for a review, 
see Gross, 1976). In highly cross-linked collagen, the molecules are so 
extensively stabilized that even collagenase can act only slowly. The 
breakdown of collagen fibrils seems to be accelerated by the presence 
of unspecific proteolytic enzymes which split the a chains at the non- 
helical telopeptide between the end of the triple helix and the hydrox- 
ylysine residue involved in cross-linking, resulting in a loosening of 
the fiber structure. Cooperation with other proteases is also necessary 
for further degradation of the triple-helical fragments PC“ and PCE 
after denaturation (reviewed by Weiss, 1976). The rate of collagenase 
attack on types I, II, and III collagen was found to be different. Thus, 
rheumatoid synovial collagenase splits types I and III collagen five 
times faster than type II collagen, although comparison of the amino 
acid sequence of the three collagen types around the cleavage site does 
not provide an explanation for this fact (Woolley et al., 1975). 


B. The Amino Propeptides 


At the amino-terminus of all a chains of types I, II, and III collagen, 
there is a 23-amino-acid-residue-long signal peptide which enables the 
nascent a chains to penetrate the membrane of the endoplasmic re- 
ticulum and is subsequently cleaved off (Fig. 6). The structure of the 
remaining amino propeptide differs from e chain to a chain. In al(D) 
and «1 (III) they consist of two main domains: a globular one 66 and 68 
residues in length stabilized by five disulfide bridges, and a triple- 
helical one of 48 and 39 residues. In the amino propeptide of «2(D the 
globular domain is missing, and the signal peptide is directly con- 
nected to the triple-helical domain. The same is true for the «1(II) 
chain. Its triple-helical domain is, however, 12 tripeptide units longer 
than that of the «1(III) chain (reviewed by Kühn, 1986). 

Two important functions are proposed for the globular amino-termi- 
nal part; (1) the control of fiber formation in the extracellular space, 
which may also involve the carboxy propeptide and will be described 
later, and (2) a function in the control of collagen biosynthesis. It has 
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Fic. 6. Schematic representation of the amino-terminal prepro region of the al(D, 
a2(D, ol (ID, and ail) chains. From al(I) and a2(1) not only the protein structure but 
also the gene structure is known. The subdivision of prepro area in exons is indicated. R, 
Amino acid residues; bp, base pairs present.in each exon. Arrows indicate the cleavage 
site for the signal peptide, and arrowheads the cleavage site for the amino propeptide. 
Data were taken from the following references: «1(I), Chu et al. (1984), Horlein et al. 
(1979); a2(1), Tate et al. (1983); a1 (ID, Yamada et al. (1983b), Brandt et al. (1984); a1 (ID, 
Kohno et al. (1984). 


been observed that the globular part of the amino propeptide specifi- 
cally inhibits collagen biosynthesis of fibroblasts in culture and also in 
cell-free translation systems (Wiestner et al., 1979; Paglia et al., 1979). 
These experimental results led to the proposal that the amino propep- 
tide may be involved in a feedback control of types I and III collagen 
biosynthesis. However, it remains unclear as to both how the extra- 
cellular amino propeptide is taken up again by the fibroblast and how 
it is transported to the polysomes where protein synthesis occurs. Since 
the al(II) chain does not contain a globular domain in the amino pro- 
peptide region, this putative feedback contro] discussed for types I and 
III collagen cannot occur for type II collagen. 


C. The Carboxy Propeptides 


The carboxy propeptides of all the four a chains [a1 (I), «2(D, al (ID, 
and al(III)] are 244 to 245 residues in length. They contain eight 
cysteine residues, of which six are involved in intrachain disulfide 
bridges, which stabilize the globular structure. Two cysteine residues 
form interchain bridges which are important for the stabilization of 
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the complex of three « chains. These interchain bridges are specifical- 
ly formed at the carboxy-terminus and are a prerequisite for the for- 
mation of a triple-helical molecule (Uitto and Prockop, 1973). In agree- 
ment with their similar function, the carboxy propeptide regions of the 
a chains are strongly homologous to one another (Yamada et al., 
1983a). However, there must be characteristic sequence regions in the 
propeptides which determine the type-specific interactions of the car- 
boxy propeptides and ensure, for instance, that two a1(D and one «2(1) 
chains regularly come together to form a type I collagen molecule and 
prevent formation between a1 (III) and a1(I) or @2(D chains. 


D. The Junction Regions 


The flexible junction regions which connect the amino and carboxy 
propeptides with the main triple helix are also important structural 
elements of collagen. They contain the cleavage site for the specific 
procollagen amino and carboxy proteinases as well as the nonhelical 
crosslink site (reviewed by Kühn, 1984). After cleavage of the junction 
regions, the telopeptides remain with the triple helix, providing the 
molecule with available oxidizable lysine or hydroxylysine residues 
essential for the formation of the intermolecular bonds. The telopep- 
tides of all a chains contain a hydroxylysine with the exception of 
«2(I). In the carboxy-terminal telopeptide of «2(D) a stretch of 11 amino 
acid residues is missing which includes the hydroxylysine residue, so 
that the carboxy end of the «2(D cannot be involved in cross-linking 
(see Yamada et al., 1983a). 


E. Control of Fiber Formation by Propeptides 


There is experimental evidence that the amino and carboxy propep- 
tides may play an important role in the control of fiber formation and 
fiber growth. With immunoelectron microscopy, Fleischmajer et al. 
(1981) have observed pN type I and III collagen molecules, which con- 
tain the amino propeptide on the surface of thin fibrils. Collagen type I 
fibrils with increasing diameter lack the amino propeptides, whereas 
collagen type III fibrils, which do not grow, keep the amino propeptide 
on their surface. These results, which were confirmed by biochemical 
investigations, led to the proposal that the amino propeptide specifi- 
cally prevents the formation of thick collagen type III fibrils. 

Fibrillogenesis experiments in vitro using pN and pC type I collagen, 
from which during fibrillogenesis the amino or the carboxy propep- 
tides were cleaved off by the specific procollagen amino and carboxy 
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Fic. 7. The cleavage site of the procollagen amino proteinase in the amino-terminal 
junction region of the a1(I) and the a1(III) chain. The enzyme splits the proline—glu- 
tamine bond (arrow). The phenylalanine residue three positions amino-terminal to the 
cleavage site seems to be important for the a1(I) chain-cleaving amino proteinase. Sub- 
stitution by tyrosine in a1(III) prevents cleavage (Morikawa et al., 1980). 


proteinases, supported the proposal that fiber formation is controlled 
by the propeptides. pN collagen itself formed thin cross-striated 
fibrils, and cleavage of the amino propeptide during fiber formation 
resulted in better structured and tighter packed fibrils of the same 
diameter. pC collagen is, because of its bulky carboxy propeptide, un- 
able to aggregate. After removing the carboxy propeptide by carboxy 
proteinase, highly ordered thick fibrils and fiber bundles were formed 
(Miyahara et al., 1984). In light of these experiments, the sequence of 
removal of the propeptides during posttranslational modifications 
seems to be important. The conclusions of biosynthetic studies are that 
type III collagen first loses the carboxy propeptide and later the amino 
peptide (Fessler et al., 1981), whereas in type I collagen, the amino 
propeptide is cleaved off before the carboxy propeptide (Fessler and 
Fessler, 1978). This strongly suggests that a controlling influence of 
the propeptides during fibrillogenesis also occurs in vivo. 

The different rates of cleavage of the amino propeptide in types I 
and III collagen can be explained by a difference of the amino acid 
sequences around the procollagen amino proteinase cleavage site Pro- 
Gln (Fig. 7). Using a variety of synthetic peptides as inhibitors for the 
type I collagen-specific amino proteinase, it was shown that a phe- 
nylalanine residue three positions amino-terminal to the cleavage site 
appears to be particularly important for the enzyme. In type III col- 
lagen, however, which contains the same cleavage site Pro-Gln but 
which cannot be cleaved by the type I collagen-specific enzyme, this 
phenylalanine residue is replaced by a tyrosine (Morikawa et al., 
1980). 


F. Interaction of Collagen with Cells 


For a long time, only the biomechanical properties of collagen and 
its function as a stabilizing scaffold of connective tissues for the orga- 
nism were of research interest. In the 1960s, it was realized that dif- 
ferentiation and behavior of cells during embryonic development are 
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not only controlled by humoral factors but also by interactions with 
constituents of the surrounding extracellular matrix (for a review, see 
Hay, 1984). Thus Hauschka and Konigsberg (1966) showed that a col- 
lagen substrate promotes the differentiation of chick embryo myo- 
blasts to myotubes. Another example is the observation of Meier and 
Hay (1974) that gels of type I or type II collagen enhance the differ- 
entiation of corneal epithelium to the same extent as found for frozen 
and killed lenses. 

Collagen can interact directly with cells via specific receptors at the 
cell surface (Mollenhauer and von der Mark, 1983), or the interaction 
can be mediated by other extracellular matrix components. The best 
investigated is the glycoprotein fibronectin which can serve as a medi- 
ator between collagen and the cell surface. Fibronectin has a particu- 
larly high affinity for denatured collagens (Jilek and Hörmann, 1978). 
Using cyanogen bromide peptides from a chains of types I, II, and III 
collagen, it was found that in all collagens the same region between 
residues 751 and 791 is responsible for collagen binding (Kleinman et 
al., 1976; Dessau et al., 1978). This region overlaps with the cleavage 
site of vertebrate collagenase, and it has been shown that the integrity 
of the collagenase-sensitive bond between residue 775 and 776 is re- 
quired for fibronectin binding (Kleinman et al., 1979). Fibronectin 
itself contains binding sites for collagen and for the cell surface (for 
reviews, see Yamada, 1983; Yamada et al., 1985). Important for the 
discussion of the physiological role of fibronectin as a mediator be- 
tween collagen and the cell is the fact that it has a higher affinity for 
denatured than for native collagen (Jilek and Hérmann, 1978). Hepa- 
rin, however, appears to stabilize the binding of native collagen to 
fibronectin (Jilek and Hérmann, 1979). 

Rubin et al. (1977) suggested that hepatocytes do not require fibro- 
nectin for binding to collagen. Hepatocytes attach equally well to all 
fiber-forming collagens, whereas native collagen was more efficient in 
promoting cell attachment than denatured collagen. Since all cyano- 
gen bromide peptides of type I collagen showed similar binding ac- 
tivities, it was suggested that cell binding sites in collagen have a 
fairly simple structure and occur in multiple copies along the « chains. 
This is in agreement with observations that collagen-like synthetic 
peptides also show a low but significant cell attachment effect (Rubin 
et al., 1984, 1986). 

Mollenhauer and von der Mark (1983) and Mollenhauer et al. (1984) 
have isolated a collagen-binding glycoprotein from chondrocyte mem- 
branes, anchorin CII, which, in vitro, preferentially reacts with native 
but not with denatured type II collagen. Treatment of type II collagen 
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with pronase, which removes the telopeptides on both ends of the mole- 
cules, abolished the reaction with anchorin CII, indicating that the 
binding sites for anchorin CII are located at the end regions of the 
collagen molecule (von der Mark et al., 1985). Since denatured type Il 
collagen does not react with anchorin CII, the binding site is conforma- 
tion dependent. A similar collagen receptor has, in the meantime, been 
isolated from bone and was designated as anchorin CI. It is similar to 
anchorin CII. Both proteins have the same M, of 34,000 and cross-react 
immunologically. 

Important for hemostatic events is the interaction of collagen with 
platelets. When subendothelial components of the vascular wall are 
exposed after loss of the endothelium, as may occur during atheroscle- 
rosis, platelets will adhere rapidly, undergo a release reaction, and 
subsequently aggregate to form a primary hemostatic plug. It has been 
shown that collagen is the most thrombogenic subendothelial compo- 
nent (Beachey et al., 1979). The collagen—platelet interaction has been 
investigated in an aggregometer. Whereas in vivo native cross-linked 
fibrils become exposed to platelets, in the in vitro experiments, pu- 
rified collagens were reacted with platelets as reformed fibrils of the 
native type, unspecific aggregates of native molecules, dissolved tri- 
ple-helical molecules, denatured « chains, and CNBr-derived peptides. 
Platelets can only be aggregated by aggregates of native molecules 
(Jaffe and Deykin, 1974; Balleisen et al., 1976). The activity of the 
fibrils seems to be independent of the array of molecules, either in a D 
staggered order or in an irregular manner. 

In contrast to aggregation, adhesion of platelets does not need col- 
lagen in an aggregated form but requires an intact triple-helical struc- 
ture (Balleisen et al., 1975a). If denatured collagen or CNBr peptides 
are used, the adhesion of platelets is drastically reduced, but there are 
some CNBr peptides (for example, the carboxy-terminal «1(I)CB6 and 
the al(IIDCB4 (position 403-551), which exhibit a small but signifi- 
cant adhesion activity (Fauvel et al., 1978a,b). If these in vitro experi- 
ments are relevant also for the in vivo situation, then in types I and III 
collagens different areas of the molecule may be responsible for 
platelet adhesion. In type I collagen it appears to be the carboxy-termi- 
nal telopeptide (Fauvel et al., 1978a) and in type III collagen a se- 
quence region (position 478-486) in the middle of the molecule (Fau- 
vel et al., 1979). The carboxy-terminal telopeptide of type I collagen 
does not seem, however, to be important for platelet aggregation. Pep- 
sin-treated type I collagen, where the carboxy-terminal telopeptide has 
been removed, as well as type I collagen, with intact telopeptides, 
exhibits in fibrillar form the same activity in aggregating platelets 
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(Balleisen et al., 1976). It has also been suggested that the first step of 
collagen-platelet interaction involves the carbohydrate moiety of col- 
lagen (Jamieson et al., 1971). The fact, however, that the collagen 
types with a high content of carbohydrates, such as types II and IV 
collagen, are less active than types I and III collagen, with a lower 
carbohydrate content, argues against an important role of these struc- 
tures (Balleisen et al., 1975b). 


G. The Antigenic Determinants of Collagen 


Antibodies against fiber-forming collagens and their precursors 
play an important role as a tool for the investigation of structure, 
biosynthesis, and metabolism of the collagens, their distribution in 
tissues, and any alterations which may occur in genetic and acquired 
connective tissue diseases. A short discussion of the relationship be- 
tween structure and antigenicity of the fiber-forming collagens fol- 
lows. At least five distinct groups of antigenic determinants have been 
recognized (for a review, see Furthmayr and Timpl, 1976) (Fig. 8). Two 
of these groups of determinants are specific for the precursor form and 
are localized in the globular domains at the ends of the molecule. 
Three further groups of determinants have been identified in the col- 
lagen molecule itself. The helical determinants are located in the 
triple-helical body of the molecule and are dependent on an intact 
triple-helical conformation. The terminal determinants, found at the 
telopeptides, are independent of an intact triple-helical conformation. 
Finally, unfolding of the triple helix exposes new determinants in the 
a chains which are called central determinants. Recognition of these 
groups of determinants during the antibody response depends on vari- 
ous parameters. Procollagens are better immunogens than collagens, 
and immunization with procollagen usually leads to production of pre- 
cursor-specific antibodies. It has also been observed that the choice of 
the animal species used for immunization may determine the recogni- 
tion sites. Rabbits, for example, produce high antibody levels to the 
nonconformational terminal determinants of collagen, while rats, 
guinea pigs, and mice show a preferential response to triple-helical 
structures (Furthmayr and Timpl, 1976). 

A precise characterization of the antigenic determinants was carried 
out for the terminal telopeptides. The method involved degradation of 
the respective CNBr peptides by proteases and comparison of the frag- 
ments in inhibition assays. For the amino-terminal region of «2(I) 
from rat and «1(]) from calf collagen as well as the carboxy-terminal 
region of calf collagen, a possible maximal size was found to involve 
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Fic. 8. Schematic representation of the antigenic determinants of procollagen and 
collagen. Determinants are located at the amino propeptide (1), the carboxy propeptide 
(2), the amino- and carboxy-terminal telopeptides (3), the triple-helical region (4), and 
the central part of the denatured « chains (5). 


five or six amino acids. Additional information could be obtained by 
comparison of the antigenic regions with the sequence of the non- 
cross-reacting collagen from rabbits, the species in which the anti- 
bodies had been raised. For each antigenic region, at least one sub- 
stitution was found on rabbit collagen. The difference either may be as 
small as a single substitution of leucine by methionine or may involve 
the whole region (Summarized by Timpl, 1976; Timpl et al., 1977). 


VII. THE GENE STRUCTURE OF COLLAGENS I, II, ano III 


Eukaryotic genes are characterized by a frequent interruption ofthe 
coding sequence by noncoding intervening sections, which results in a 
characteristic exon structure for each gene. Comparison of gene struc- 
tures with the respective protein structures revealed that exons often 
(but not always) code for structural or functional protein domains 
(Gilbert, 1978). Earlier, McLachlan (1976) proposed that there is a 
primordial gene unit, with the length of D, from which the collagen 
genes would have evolved by duplication. Closer investigation of the 
amino acid sequence of the triple-helical domain revealed a number of 
additional repeats (Table III) (Hofmann et al., 1980). The D/3 repeat, 
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TABLE III 
REPEATING UNITS OF THE AMINO ACID SEQUENCE OBSERVED IN THE TRIPLE-HELICAL 
DOMAIN OF THE al(I), @2(D, AND al (III) CHarnse 


D = 234 Residues 
Formed by polar-charged and hydrophobic (interactive) amino acids; con- 
trol of self-assembly, D-staggered array 
D/3 = 78 Residues 
Formed by noninteractive amino acids; length of triple-helical domain 4D 
+ D/3; hole zone 2/3D, overlap zone 1/3D 
D/6 = 39 Residues 
Formed by polar-charged amino acids (pitch?) 
D/11 = 21.3 Residues (2D/11 42.6 residues) 
Formed by hydrophobic amino acids (pitch?) 
D/13 = 18 Residues (2D/13 36 residues) 
Formed by interactive and noninteractive amino acids; primordial gene 
unit 


aFrom Hofmann et al. (1980). 


for example, has also been considered as a functional unit since it 
correlates with the length of the overlap regions of two 4D staggered 
molecules. In the fibril, therefore, each period of 67 nm is divided into 
a hole and an overlap zone 2 D/3 and 1 D/3 long, respectively. In 
contrast to the D repeat, which is formed mainly by the polar charged 
and hydrophobic amino acids (the interactive residues), the D/3 repeat 
is mainly formed by the group of noninteractive residues, particularly 
by prolines and alanines. Because the noninteractive residues have not 
been exposed to a specific selection pressure directed toward an im- 
proved aggregation of the molecules into the fibril, it was suggested 
that the original genetic unit from which the molecules developed by 
gene duplication may be better reflected by the D/3 rather than by the 
D repeat (Hofmann et al., 1980). No obvious function has been found 
for the shorter repeats D/6, D/11, and D/13 present in the triple- 
helical area. They were thought to determine the pitch of the triple 
helix (Hulmes et al., 1973). The fact that the pro « chains consist of 
three structural and functional domains, the triple helix and the two 
terminal procollagen peptides, suggests that the gene may also be 
composed of three sections of different exon structure, and the ques- 
tions arise of whether or not they have evolved differently and when 
during the course of evolution they were combined to one gene. 

The application of recombinant DNA technology resulted initially in 
the completion of the covalent structure of all four « chains. This is 
especially true for the carboxy-terminal propeptide regions, where 
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Fic. 9. Schematic representation of the approximately 18-kb-long human al(I) gene. 
Data are taken from Chu et al. (1984). The bars reflect the positions of the exons along 
the al(D gene determined by R-loop experiments. The length of the individual exons of 
the carboxy and amino propeptide regions are indicated. In the triple-helical domain, 
thin bars reflect exons with 54 nucleotides and thick bars reflect exons with 108 nu- 
cleotides. Exons with the length of 45, 99, and 162 bp are indicated. The length of the 
exons, particularly of the triple-helical domain and the carboxy propeptide region, are 
almost identical in all investigated genes of the fiber-forming collagens I, II, and III. The 
longest gene is «2(I) with approximately 40 kb (Boedtker et al., 1983). 


only few protein sequences were available (Fuller and Boedtker, 1981; 
Dickson et al., 1981; Yamada et al., 1983a; Sandell et al., 1984). The 
extreme amino-terminal part of the a chains, the signal peptides, and 
to some extent the amino propeptide would also not be known today 
without the DNA structure (Tate et al., 1983; Vogeli et al., 1981; 
Yamada et al., 1983b; Kohno et al., 1984). 

Finally, these investigations led to the gene structure of «2(I) 
(Boedtker et al., 1983) and al (III) (Yamada et al., 1984) of chicken, and 
a1(I) from human (Chu et al., 1984) (Fig. 9). The size of collagen genes 
varies from 39,000 bp in the case of a2(I) chicken (Boedtker et al., 
1983) to 18,000 bp for «l(I) human (Chu et al., 1984). The coding 
sequences, about 10-30% of the gene, are frequently interrupted. The 
number and size of exons are very similar in all genes of the fiber- 
forming collagens. The size of the introns, however, can vary toa much 
higher degree (for a review, see Boedtker et al., 1983). The comparison 
of the subdivision of the coding sequence in exons with the multifunc- 
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tional domain structure of collagen proteins led, on the one hand, to a 
better definition of the functional domains important for the forma- 
tion of the molecular and macromolecular structures. On the other 
hand there are periodic structures, mainly in the triple-helical domain, 
which do not easily relate to the exon composition of this area. 


A. The Triple-Helical Domain 


The coding sequence of the triple-helical domain of a1(I gene from 
human is divided into 41 exons, plus two exons at the end region that 
include in addition the flanking regions of the triple helix (Fig. 9). The 
basic exon unit of the triple-helical domain is 54 bp in length (Table 
IV). Exons with a length of 108 or 162 bp seem to have originated by 
the fusion of a 54-bp unit to form dimers and trimers. The occurrence 
of exons 45 and 99 bp long can be explained by the deletion of nine 
nucleotides corresponding to a tripeptide unit Gly-X-Y (Yamada et al., 
1980). 

Knowing the exon structure of the gene as well as the protein struc- 
ture, attempts can now be made to analyze how these two periodic 
structures correlate with each other. It is clear that the 18-amino-acid- 
residue-long D/13 repeat found in the protein structure reflects the 54- 
bp exon unit of the triple helix. There also appears to be a relationship 
between the D repeat and the D/13-long exon unit. Thus, the carboxy- 
terminal half of the triple-helical domain consists of a tandem repeat 
of 26 exon units whereby the borders of the exon units correlate exact- 
ly with the end of the D repeats (H. Hofmann and K. Kühn, un- 
published results). Molecules staggered against one another by integer 
multiples of an exon unit should show a preferred interaction between 
homologous regions. The optimal interaction between 26-exon unit- 


TABLE IV 
SIZES OF THE 41 Exons (5-45) OF THE 
TRIPLE-HELICAL DoMAIN“® 


Exon 
length (bp) Number of exons 
54 21 (1x, primordial gene unit) 
108 9 (2x) 
162 1 (3X) 
45 5 (1x, deletion of 9 bp) 
99 5 (2x, deletion of 9 bp) 


aSee Fig. 9. 
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long hypothetical molecules to fibrillar structures would occur by a 
stagger of 13 exon units, which in fact correlates with D. In this 
scheme, the fact that exon units with a distance of D exhibit the 
highest homology between one another also fits. The situation in the 
amino-terminal half of the triple-helical domain is, however, differ- 
ent. Because of a series of deletions of tripeptides, the exon units are 
out of phase with the D units in this area. But here it can be observed 
also that exon units, almost in phase with the D scheme, show the 
highest interaction with the neighboring molecules. 

It is likely that the triple-helical domain has evolved by repeated 
duplication of the exon unit of 54 bp. Once, by chance, a unit of 13 
exons may have duplicated as such, and thus the origin of the repeat- 
ing D unit was created. As this pattern proved favorable for an ordered 
assembly into fibrils, it was further strengthened by mutation and 
selective pressure as it can be observed in the amino terminal half of 
the triple-helical domain. Although the evolution of collagen cannot be 
reconstructed in detail by the comparison of the gene and the protein 
structure, it seems to be obvious that the D repeat of the triple helix 
has not evolved independently of the preexisting exon unit of 54 bp (H. 
Hofmann and K. Kihn, unpublished results). 


B. The Amino-Terminal Region 


The gene structure of this area is known for «2(I) from chick (Tate et 
al., 1983), a1(I) from human (Chu et al., 1984), and, in part, a1 (III) 
from chick (Yamada et al., 1983b) (Fig. 6). The subdivision of the DNA 
into coding exons agrees well with the structural elements at the pro- 
tein level. In the a1(I) chain, the amino-terminal signal peptide, the 
globular domain, and the short, flexible connection between the 
globular and the triple-helical portion are each encoded by one exon. 
The triple-helical portion consists of two exons of 36 and 102 bp whose 
length does not match the exon unit of 54 bp found in the main triple 
helix. The amino-terminal end of the «2(I) chain does not contain a 
globular domain. Here, the signal peptide is directly attached to the 
triple helix and is encoded by a 70-bp-long exon. The two following 
short exons, 11 and 18 bp in length, appear to be remnants of the 
globular part and its connection to the triple helix. This exon composi- 
tion suggests that the «2(I) chain originally possessed a globular do- 
main similar to «l(I) and that it was lost during the course of evo- 
lution. 
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Fic. 10. Scheme of the 283-nucleotide-long exon 4 of the al(III) gene encoding the 
carboxy junction area. Numbers give the position of amino acid residues in the area of 
exon 4. Arrow indicates the carboxy propeptidase cleavage site between arginine and 
aspartic acid. Cysteines in positions 18, 19, 82, and 88 are involved in interchain dis- 
ulfide bridges (Yamada et al., 1983a). R, Amino acid residues. 


C. The Junction Regions 


It seems likely that the periodic structure of the triple-helical do- 
main of the a chains evolved independently and according to different 
rules than the two globular propeptides. The triple helix, containing 
almost all information for the self-assembly of the molecules into 
fibrils, had to be completed by control elements, the two propeptides. 
The two regions used to connect the propeptides with the triple helix 
have gained important functions at the protein level, which is pre- 
served at the gene level using the two exons 46 and 4. Figure 10 
depicts, as an example, the carboxy-terminal function encoded by exon 
4, the longest continuous coding sequence of the collagen gene 
(Yamada et al., 1983a; Boedtker et al., 1983). In addition to the func- 
tions which can also be attributed to the amino-terminal junctions, 
such as to serve as a flexible connection, to provide the cleavage sites 
for the procollagen proteinases, and to contain a hydroxylysine residue 
involved in cross-linking, the carboxy-terminus is important for the 
formation of the triple helix initiated by a specific alignment of the 
carboxy propeptides of three « chains. Exon 4 includes the end part of 
the triple helix, the carboxy telopeptide, and the amino end region of 
the carboxy propeptide. The end region of the triple helix is charac- 
terized by a particularly high stability. In «1(I) and a2(I) it consists of 
five consecutive tripeptide units Gly-Pro-Hyp, and in type III collagen 
the very end of the helix is sealed by interchain disulfide bridges. The 
structure is thus adapted to a special function of this region, namely to 
facilitate the initiation of triple-helix formation at the carboxy-termi- 
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nus by stabilizing the short helical section first formed. The adjacent 
flexible connection to the carboxy propeptide of 38 amino acid residues 
exhibits a large number of amino acid substitutions between the a 
chains (Yamada et al., 1983a). It has been shown that, after aggrega- 
tion of three carboxy propeptides, triple-helix formation can only occur 
after the stabilization of the trimer by disulfide bridges (Uitto and 
Prockop, 1973; Fessler et al., 1981). Exon 4, which has been formed 
during the fusion process of two gene sections coding for the triple 
helical domain and the carboxy propeptide, respectively, is an excel- 
lent example of how a functionally important area is preserved by one 
exon although it is composed of three structurally different elements. 


VIII. Synopsis 


The three fiber-forming collagens, types I, II, and III, described in 
this article have, in principle, the same molecular structure. In their 
final processed form, more than 97% of the molecules consist of a 
continuous triple helical domain along which the main information for 
an ordered parallel and D staggered self-assembly is located. Thus, 
fibrils of the three collagens when reconstituted in vitro do not show 
striking differences. In vivo, however, the macromolecular structure of 
the three collagen types is variable. Comparative electron microscopy 
of tendon and cornea, both of which contain type I collagen, and 
hyaline cartilage, which consists mainly of type II collagen, revealed 
that the latter generally forms fibrils of smaller diameter than type I 
collagen. In addition, the often observed tendency of type I fibrils to 
assemble in parallel bundles is not found in type II collagen. Immu- 
noelectron microscopy of human skin with ferritin-labeled antibodies 
against types I and III collagen revealed thick fiber bundles of type I 
and thin fibrils of type III collagen. Although these electron micro- 
scopic investigations indicate that type I and type III collagens may 
form separate fibrils (Fleischmajer et al., 1981), there is chemical evi- 
dence that both collagens can be present in the same fibrils (Henkel 
and Glanville, 1982). 

Variations in the macromolecular structure of the three collagen 
types probably modify the biomechanical properties of the extra- 
cellular matrix to the various physiological functions of different con- 
nective tissues. However, these variations cannot be explained by the 
differences in the amino acid sequence of the triple-helical molecules. 
It is, however, likely that some of the posttranslational modification 
steps are responsible, at least to some extent, in controlling fiber diam- 
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eter. Thus, it is striking that the degree of glycosylation of type II 
collagen, which generally forms small fibrils, is higher than in type I 
collagen. Another possible influence on fibril assembly is by control- 
ling the time course in the removal of the amino and carboxy propep- 
tides (Miyahara et al., 1984). 

In this context, type V collagen, which also belongs to the family of 
the fiber forming collagens, and is described in detail by Fessler and 
Fessler (this volume), should be mentioned. The primary structure of 
the carboxy propeptide, as well as of the triple-helical domain of type 
V collagen, is homologous to that of the types I, II, and III collagen. 
Major differences exist in the amino propeptide region, which is pro- 
cessed in several steps and remains part of the molecule when incorpo- 
rated into a fibril (Broek et al., 1985). The macromolecular structure of 
type V collagen has not been fully established. However, it appears to 
form small fibrils and is often found in a pericellular arrangement, for 
instance in the vicinity of smooth muscle cells (Broek et al., 1985). 

During the last five years it has become clear that the ability of the 
fiber-forming collagens to vary their macromolecular organization is 
not enough to build stabilizing scaffolds for all of the different extra- 
cellular matrices. To introduce a higher variability, triple-helical ele- 
ments are often combined with globular domains on the one hand; on 
the other hand, the length and some structural features of the triple 
helical domain itself were often altered (see Table I). A typical exam- 
ple is type IV collagen, the collagenous component of all basement 
membranes (see Glanville, this volume). In order to adapt the mac- 
romolecular organization of this collagen to the sheetlike flexible 
structure of basement membranes, the 390-nm-long molecules of type 
IV collagen do not form fibrils but a network in which they are con- 
nected to one another via their like ends (Timp! ei al., 1981). To give 
the network the necessary flexibility, the triple helix was frequently 
interrupted by nonhelical areas. This reduces the stability of the triple 
helix in these areas and is compensated by an altered distribution of 
the proline residues between the positions X and Y in favor of position 
Y. Proline residues are only hydroxylated in position Y, and thus the 
triple helix is stabilized. Another important difference from the fiber- 
forming collagens was observed for the first time in type IV collagen. 
The C-terminal globular domain of the molecule is not removed but is 
used as an integral part of the final macromolecular structure of the 
molecule. Like most of the presently known collagens, type IV collagen 
contains the triple-helical domain as a major structural element. 
There is one exception, type VI collagen, where the triple-helical do- 
main represents only one-third of the molecule. It contains two large 
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terminal, globular domains which are also incorporated into the mac- 
romolecular structure of type VI collagen (see Timpl and Engel, this 
volume). 

Some of the newly discovered collagens (types IV, VI, and VII) fall 
into the classical definition of collagenous proteins. After biosynthesis 
of the triple-helical or partially triple-helical molecules, they are se- 
creted into the extracellular space where they aggregate to mac- 
romolecular structures which are subsequently stabilized by covalent, 
intermolecular bonds. Some of the new collagens, however, do not fall 
into this category. Type IX collagen, for example, does not appear to 
form its own macromolecular structure and is thought to aggregate 
with type U collagen and to influence the formation of the fibers of 
type II collagen in hyaline cartilage (Müller-Glauser et al., 1986). An- 
other new aspect of the collagen family is represented by type X col- 
lagen. It is synthesized only by chondrocytes from hypertrophic car- 
tilage (Gibson and Flint, 1985) and is thought to act as a factor in- 
volved in endochondrial calcification (see Schmid and Linsenmayer, 
this volume). Triple-helical segments are also found in other, less- 
characterized collagenous components (Sage, 1985; Yamauchi et al., 
1986) and in other proteins, such as the complement subcomponent 
Clq (Reid, 1979) acetylcholinesterase (Mays and Rosenberry, 1981) 
and conglutinin (Strang et al., 1986). For these latter proteins, the 
triple-helical elements are combined with globular domains which are 
functionally active. In addition, short collagenous sequences are pres- 
ent in other proteins such as the mannose-binding proteins of liver 
(Drickamer et al., 1986) and the pulmonary surfactant apoproteins 
(Floros et al., 1986). These results all indicate that the triple helix, the 
main structural element of the classical fiber-forming collagen, is 
more widely used than previously thought. 

The question arises whether all the triple helices derived from the 
different collagenous proteins have evolved from a common ancestor. 
The continuous triple-helical domain of the fiber-forming collagens is 
formed by duplication of a primordial gene unit of 54 bp (Yamada et 
al., 1980). Present information on the gene structure of other collagen 
genes, such as type IV collagen (Kurkinen et al., 1985; Soininen et al., 
1986; Sakurai et al., 1986), type IX collagen (Lozano et al., 1985), and 
also collagens from Drosophila (Monson et al., 1982) and nematodes 
(Kramer et al., 1982) revealed exon units which obviously do not fit 
into the 54-bp scheme. This would mean that there is no simple rela- 
tionship between the triple-helical domains of different collagens and 
that they may have evolved in different ways. The present knowledge 
of the gene structure of the individual collagens does not yet allow a 
more detailed discussion of their evolutionary development. 
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I. INTRODUCTION 


Type IV collagen is a highly specialized form of collagen found only 
in basement membranes. It is one of the major components of all base- 
ment membranes together with the glycoproteins laminin, nidogen, 
entactin, and heparan sulfate proteoglycan. 

Basement membranes are ubiquitous, thin, sheetlike structures 
found frequently under epithelial and endothelial cell linings but also 
surrounding many cell types such as muscle, nerve, and fat. They 
function as a selective filtration barrier for macromolecules, for exam- 
ple, in the kidney, blood—brain barrier, and placenta, but also separate 
extracellular matrix from epithelial or endothelial cell layers as in 
gut, skin, cornea, lung, and blood vessels. For more information about 
the structure and function of basement membranes and their non- 
collagenous components, the reader is referred to the following books 
and reviews (Kefalides, 1978; Robert et al., 1979; Kühn et al., 1982; 
Timpl and Martin, 1982; Timpl and Dziadek, 1986). 

Indications that basement membranes contained a collagen came 
from X-ray studies of intact basement membranes as early as 1951 
(Pirie, 1951). Later, hydroxyproline and then hydroxylysine were de- 
tected in amino acid compositions of whole basement membranes (Kef- 
alides and Winzler, 1966). Because of the insolubility of basement 
membrane components, attempts were made to solubilize the collagen 
using Pronase, a method that had proved useful for type I collagen 
(Kefalides, 1968). The material that was isolated and characterized 
was clearly different from the other interstitial collagens known at 
that time, i.e., «1(D, «1(ID), and «1(TID). Basement membrane collagen 
was therefore designated type IV collagen (Kefalides, 1973). 
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II. STRUCTURE 


A. Macromolecular Structure Using 
the Electron Microscope 


Because of the size of the collagen triple-helical molecule, it has 
been possible to use the electron microscope to investigate aggregates 
formed in vivo and in vitro to gain some insight into the macromolec- 
ular structure ofthe molecules. In the case ofthe interstitial collagens, 
observations on the banding patterns of native fibers, segment long 
spacing (SLS) crystallites (Kühn, 1982), and more recently rotary 
shadowed molecules, were instrumental in defining the detailed struc- 
ture of these molecules (Furthmayr and Madri, 1982). In the case of 
type IV collagen, in vivo observations of macromolecular structures 
formed in vivo have been, as yet, of limited value because of the amor- 
phous structure of basement membranes. 


1. SEGMENT LONG SPACING CRYSTALLITES 


Type IV collagen does not form well-ordered SLS crystallites. Exten- 
sive investigations on acid-extracted lens capsule material indicated 
that a globular noncollagenous domain was present (Schwartz and 
Veis, 1978, 1979). It was, however, incorrectly deduced that there was 
a globular domain at each end of the molecule. Even after mild pepsin 
digestion, SLS crystallites showing well-defined cross-striation ban- 
ding patterns were difficult to prepare. It was possible to say that the 
banding pattern, and thus the primary structure of type IV, differed 
from that of the interstitial collagens (Schwartz and Veis, 1978; Timpl 
et al., 1978; Glanville et al., 1979). Using trypsin-resistant fragments, 
crystallites were formed that were considerably longer (350 nm) than 
those of the interstitial collagens (290 nm), indicating for the first 
time that the helical domain of type IV was longer (Timpl et al., 1978). 


2. IMAGES or Rotary SHADOWED MOLECULES 


Applying the rotary shadowing technique to type IV collagen prepa- 
rations revealed how different the tertiary structure of this molecule 
was to all other well-characterized collagens. Initial studies were done 
on pepsin-solubilized material which indicated for the first time that 
the basic structural unit of basement membrane collagen is probably a 
tetramer (Kühn et al., 1981). This also showed that 7 S collagen, a 
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disulfide cross-linked collagenase-resistant fragment that had been 
originally isolated from the EHS mouse tumor (Timpl et al., 1979b) 
‚and later from other tissues (Risteli et al., 1980a; Dixit et al., 1981; 
Madri et al., 1983; Qian and Glanville, 1984) was, in fact, part of the 
type IV collagen molecule. 

As shown in Fig. 1A, the most intact form of type IV collagen pre- 
sent in a pepsin digest appeared to be four triple-helical molecules 
overlapping at one end which was later shown to be the amino-termi- 
nal end. Many of the molecules have one or more arms missing. The 
missing fragments were further degraded as evidenced by the pres- 
ence of a number of monomeric fragments of various lengths (Glan- 
ville et al., 1982). Increasing the temperature of pepsin digestion 
caused the removal of all four arms leaving the central overlap do- 
main, called the short form of the 7 S domain. Under slightly milder 
conditions an intermediate, carrying vestiges of all four arms, can be 
prepared and is referred to as the 7 S long form (Fig. 1B). Both forms 
of the 7 S domain can also be prepared, under appropriate conditions, 
using bacterial collagenase in place of pepsin (Fig. 1C). 

Careful measurements gave values of 356 + 10 nm for the lengths 
of the arms and 30 + 1 nm for the central overlap region, giving a total 
molecular length of 386 nm (Kühn et al., 1981). Subsequently, a value 
of 417 nm was reported which was considered to be a significant dif- 
ference in length (Bächinger et al., 1982b). It was suggested that the 
longer molecule may represent a type IV procollagen which carries an 
amino-terminal extension. This total length is considerably longer 
than the helix of the interstitial collagens, which is around 300 nm. 
The electron microscope pictures not only gave indications of the size 
and macromolecular organization of type IV molecules but also flexi- 
ble sites in the threadlike arms of the molecules could also be localized 
(Glanville et al., 1982). Most of the detected flexible sites allowed 
random bending about the mean 0°, but in one case, at the border of 
the 7 S domain of tetrameric type IV, a flexible site with a preferential 
angle of 40° was found (Hofmann et al., 1984). This is probably an 
important feature for the formation of higher aggregates of tetramers 
as discussed below. 

Having characterized the pepsin-solubilized type IV material, the 
next step was to investigate more intact material produced in cell 
culture or, in one case, by direct extraction of a mouse tumor with 
acetic acid (Timpl et al., 1981; Oberbäumer et al., 1982; Bächinger et 
al., 1982b). Examination ofthe rotary shadowed pictures of such intact 
molecules revealed a globular domain at the end of the molecule op- 
posite to the 7 S domain as shown in Figure 2D. Furthermore, dimeric 
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Fic. 2. Electron micrographs of intact type IV collagen molecules found in cell culture 
medium. (A) Tetramers from PF-HR9 cells (left), WISH cells (right). (B) Dimers from 
PF-HR9 cells formed by association of NC1 domains. (C) Dimers from PF-HR9 cells 
formed by association of amino-termini. (D) Monomers from PF-HR9 cells. Bar, 100 nm. 


Fic. 1. Electron micrographs of proteolytic fragments of type IV collagen prepared by 
the rotary shadowing technique. (A) Tetrameric type IV collagen prepared from pepsin 
digests of the EHS mouse tumor (left) and human placenta (right). (B) Long form of the 
7 S domain of human placenta type IV collagen from a pepsin digestion at 20°C. (C) 
Short form of the 7 S domain of EHS mouse tumor type IV collagen from a collagenase 
digest of pepsin-solubilized material at 37°C. (D) Hexameric globular domain NC1 of 
EHS mouse tumor type IV collagen from a collagenase digest of tissue at 37°C. Bars, 100 
nm. 
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molecules were seen joined via their globular domains (Fig. 2B) as well 
as the expected dimers and tetramers by association of the helical 
regions that form the 7 S domain (Fig. 2A and 2C). The dimer globular 
domain (Fig. 1D) can be isolated from bacterial collagenase digests of 
basement-membrane-rich tissues (see NC1 domain below). 

These electron microscope investigations, therefore, gave a good in- 
dication of the size and shape of the type IV collagen molecule as well 
as valuable information about the way in which molecules interact to 
form macromolecular structures. Both ends of the molecular appear to 
be cross-linking domains, and the arms function as flexible spacers. 

Two macromolecular organizations of type IV collagen molecules 
have been proposed, both compatible with the observations described 
above. The difference between the two possibilities seems to depend 
upon whether the tetrameric aggregate is formed first by self assem- 
bly of the 7 S domain or whether dimers are first formed by interaction 
of two globular domains (Fig. 3). The former results in a network 
model which involves only interaction of the ends of the molecules 
(Timpl et al., 1981) while the latter involves lateral interactions giving 
rise to a regular polygonal ‘structure (Yurchenco and Furthmayr, 
1984). 

In a cell culture system, the monomers (Fig. 2D) were observed to 
associate mainly into tetramers (Fig. 2A) by formation of the 7 S 
domain (Duncan et al., 1983). Only small amounts of material were in 
the form of dimers, formed by association of the carboxy-terminal 
(Fig. 2B) or amino-terminal (Fig. 2C). These data support the network 
model. The polygonal model was based upon rotary shadowing studies 
of the association products of preformed dimers, similar to those 
shown in Fig. 2B, that had been reduced under nondenaturing condi- 
tions. The physiological significance of these observations is, there- 
fore, in question. Nevertheless, a regular polygonal structure is sup- 
ported by direct observation of Descemet’s membrane (Sawada, 1982) 
and renal tubular basement membranes in which a hexagonal array 
could be seen (Carlson ei al., 1981). It was, however, not demonstrated 
that the observed structure was in fact type IV collagen. 

A third model, which is based upon the network model but incorpo- 
rates globular domain/helix interactions and possibly helix/helix in- 
teractions, was developed based upon assumptions about cross-linking 
(Bailey et al., 1984). The assumptions made were that the cross-linking 
site sequence Arg-His-Gly-Hyl, which has been conserved in in- 
terstitial collagens, is also present in type IV collagen and that these 
particular sites are at certain positions along the molecule, thus allow- 
ing formation of the proposed structure. However, these interstitial 
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Fic. 3. Schematic diagram illustrating two possible ways for monomer type IV col- 
lagen molecules to associate to form macromolecular structures. Formation of the 7 S 
domain through association of four molecules at their amino-termini gives rise to 
tetramers that can aggregate further into a regular tetragonal or irregular polygonal 
meshwork. Formation of dimers through association of two NC1 domains could lead to 
the formation of hexagons containing 12 molecules. These could aggregate further into a 
regular hexagonal or irregular polygonal network. The distinguishing features of these 
models are that the first possibility requires no overlap or lateral interaction of the 
triple-helical portion of the molecules whereas the second does. The “pore size” of the 
regular networks would also clearly be different. 


cross-linking sites are not present in type IV collagen and so the pro- 
posed structure remains a model without justification. 


B. Characterization of Proteolytic Fragments 


It has long been known that the collagenous component of basement 
membranes is insoluble under nondenaturing conditions. Original at- 
tempts to isolate basement membrane collagen were first made using 
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glomeruli which were either extracted after reduction and alkylation 
under denaturing conditions or treated with proteases such as Pronase 
and collagenase (Marquardt et al., 1973a,b; Hudson and Spiro, 1972; 
Sato and Spiro, 1976; Levine and Spiro, 1979). The former method 
produced small amounts of high-molecular-weight materials, while 
the latter produced complex mixtures of fragments. These investiga- 
tions indicated that type IV collagen contained cysteine, lysine that 
was extensively hydroxylated and glycosylated, and that type IV col- 
lagen was a homotrimer (Kefalides, 1973). A glycopeptide, carrying 
both asparagine-linked and hydroxylysine-linked carbohydrates, iso- 
lated from a collagenase/pronase digest of glomerular basement mem- 
brane (Levine and Spiro, 1979) was later localized to the 7 S domain of 
type IV (Glanville e¢ al., 1985). 

The first indications that type IV collagen did, in fact, contain two 
different a chains came in 1975 when Daniels and Chou used pepsin to 
solubilize isolated bovine glomeruli and characterized the pepsin frag- 
ments so produced. This observation was later substantiated by a 
number of groups working on renal glomerulus (Tryggvason and 
Kivirikko, 1978; Dixit, 1979, 1980; Trüeb et al., 1982), human placenta 
(Glanville et al., 1979; Bailey et al., 1979; Kresina and Miller, 1979; 
Sage et al., 1979; MacWright et al., 1983), lens capsule (Dixit, 1978; 
Gay and Miller, 1979), and chick muscle (Mayne and Zettergren, 
1980). 

An important step in the further characterization of type IV col- 
lagen was the discovery of a murine tumor that produced basement 
membrane components (Orkin et al., 1977). When mice carrying this 
tumor were made lathyritic, it was possible to extract a small amount 
of type IV collagen from the tumor using dilute acetic acid (Timpl et 
al., 1978). This material was later characterized using the rotary shad- 
owing technique and shown to be various aggregates of intact type IV 
collagen molecules (Timp! et al., 1981). Limited pepsin digestion of the 
tumor produced fragments P1 and P2 and, after bacterial collagenase 
digestion, the noncollagenous domains NC1 and NC2 (Timpl et al., 
1979a). These fragments were important in establishing the domain 
structure and primary structure of type IV collagen. 


1. HELICAL DOMAIN 


Pepsin digestion of various basement membrane-containing tissue 
preparations from various species produces different-sized pepsin 
fragments. This led to some confusion in the early literature as to the 
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Fic. 4. Schematic diagram showing the positions and names of the major pepsin 
fragments of the a1 and «2 chains of type IV collagen. V, Major pepsin cleavage sites; V, 
nontriplet sequences; —, regions of known amino acid sequence; ---, regions of unknown 
amino acid sequence. C-1, C, D-1, D, 50K1, 50K2, 15K2 fragments from bovine type IV 
(Dixit and Kang, 1979); F1, F2, F3A, F3B, F3C fragments from chick type IV (Mayne 
and Zettergren, 1980); P1, P2, P3A, P3B, P3C fragments from mouse type IV (Timp! et 
al., 1979a); «1(IV) 140, 95, 50, 7sL, 7sS and «2(IV) 120, 95, 70, 50, 7sL, 7sS fragments 
from human type IV (Glanville and Rauter, 1981; Glanville ei al., 1985). 


origin of these fragments and their relationship to one another. Figure 
4 is a schematic diagram of the type IV collagen « chains indicating 
the major structural features that have been described to date. Below 
each chain the position and name of the pepsin fragments that have 
been described are shown. Lens capsule, human placenta, and kidney 
appear to have similar fragmentation patterns, whereas mouse and 
chick are different. 
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The alignment of the pepsin fragments to either the al or «2 chain 
was possible, in most cases, by comparing their cyanogen bromide 
(CNBr) digest patterns on SDS electrophoresis gels. The initial assign- 
ment of one set of fragments to the al and the other to the a2 chain 
was arbitrary. CNBr peptides of the C-1 chain of bovine lens capsule 
(Dixit and Kang, 1979) and the porcine kidney D chain (Dixit and 
Kang, 1980) were isolated and characterized. The CNBr peptides of 
mouse P1 and P2 (Schuppan et al., 1982; Schwartz et al., 1986), human 
placenta (Babel and Glanville, 1984), and bovine lens capsule 50K 
(Schuppan et al., 1984) have been sequenced. The P1 regions of mouse 
and human al(IV) and part of the P2 region of mouse «2(IV) were also 
sequenced using cDNA sequencing methods (see Section III). 

Amino acid sequencing studies of various pepsin fragments derived 
from mouse and human type IV collagen soon revealed that the contin- 
uous triplet sequence (Gly-X-Y),, so characteristic of all other se- 
quenced collagens, was frequently interrupted in type IV collagen 
(Schuppan et al., 1980; Glanville and Rauter, 1981). In fact, in the 
known sequence of the a1 chain there are 21 interruptions and in the 
«2 chain, for which only half as much sequence data is available, 
about 10 (see Sections IX and X). It has been possible to accurately 
align the sequences of the mouse P1 and P2 fragments because they 
both terminate at the carboxy-terminal noncollagenous domain. This 
alignment revealed that the nontriplet interruptions did not always 
coincide in the helix and they were usually of different lengths 
(Schwartz et al., 1986). Presumably, the interruptions introduce flexi- 
bility into the arms of the type IV tetrameric structure as calculated 
from pictures of rotary shadowed molecules (see Section 11,A). The 
position and size of the interruptions has been conserved in different 
species as shown by comparing P1 regions of mouse, bovine, and 
human al type IV chains (Schuppan et al., 1984; Babel and Glanville, 
1984). It may be that at least some of these regions are binding sites 
for other basement membrane components (see Section VIII). 

The positions and the sizes of the triplet interruptions within one 
chain do not appear to follow any regular pattern, and their composi- 
tions show no common characteristics. In several interruptions, hy- 
droxyproline and hydroxylysine were found, indicating that the en- 
zymes responsible for the hydroxylation of these amino acids do not 
require a continuous triplet sequence to function. 

Comparing the Pl region of mouse al(IV) with the corresponding 
region of human al(IV) reveals a 90% identity of sequences (454 out of 
501 residues). Of the 47 differences in sequences, 40 were conser- 
vative, that is, no change in polarity. Four of the conservative sub- 
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Fic. 5. (A) A schematic representation of the carboxy-terminal triple-helical region of 
type IV collagen (P1/P2 region). Nontriplet interruptions are indicated and numbered 
with Roman numerals. The amino acid sequences of both al and a2 chains in these 
regions are shown. Bold letters depict nontriplet sequences. Gaps, indicated by a broken 
line, are introduced to allow alignment of triplet sequences. Pluses mark the position of 
hydroxyproline and hydroxylysine residues. Note that in sections II, V, VII, and IX the 
tripeptide structure is interrupted only in a2. (B) A schematic diagram of the 7 S domain 
of type IV collagen indicating the positions of cross-linking sites, the bend region, and a 
particularly stable helical region termed the cap site. The amino acid sequences of both 
al and a2 chains are shown for these regions. Cysteine residues and lysine/hydroxy- 
lysine residues that may be involved in cross-linking are shown in bold type. 


stitutions were in nontriplet sequences. This is in contrast to the in- 
terspecies homology in interstitial collagens where around 95% 
identity was found with only conservative substitutions being present. 

If, however, position and size of the triplet sequence interruptions of 
P1 and P2 are compared, no similarities can be seen (Fig. 5A). P1 
contains five interruptions whereas P2 has nine. Even in those posi- 
tions where both chains have interruptions, they are of different 
lengths, the longer one usually being in the a2 chain. 
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There is also no significant homology between the triplet sequence 
of Pl and P2. Whether this low homology is restricted to the C-termi- 
nal half of the a chains or extends throughout the triple-helical arm 
region is not known. However, a considerably higher homology exists 
between the al and a2 chains within the 7 S and NC1 domains (see 
below). This suggests different constraints on different regions within 
type IV collagen during the evolutionary divergence of the genes for 
the al(IV) and a2(IV) chains. 

The distribution of amino acids between the X and Y positions of the 
repeating triplet, Gly-X-Y, is typical of any triple-helical molecule 
(Babel and Glanville, 1984). The large hydrophobic amino acids, leu- 
cine, tyrosine, phenylalanine, and tryptophan, are located exclusively 
in the X positions. Glutamic acid, aspartic acid, and serine also have a 
strong preference for X, while the small neutral amino acids are fairly 
evenly distributed. One difference between the interstitial collagens 
and type IV is the strong preference of lysine and proline for the Y 
position in type IV, whereas in the interstitial collagens they are more 
evenly distributed. 

One small but interesting difference between human, bovine, and 
mouse type IV collagens is their content of 3-hydroxyproline. This 
amino acid is formed by a specific enzyme which hydroxylates proline 
in the amino acid sequence Gly-Pro-4-Hyp. Despite the frequent occur- 
rence of this sequence in all collagens, only one 3-hydroxyproline resi- 
due was found in sequences of interstitial collagens, in bovine al(D) 
CB6 (Fietzek et al., 1972). Compositional data indicate the presence of 
two residues in al(II) CB9,7 (Miller and Lunde, 1973). However, in 
both mouse and bovine P1 sequences there are two 3-hydroxyproline 
residues located in exactly the same positions. In human type IV the 
corresponding positions are not hydroxylated, and in mouse P2 these 
positions are not proline residues. The significance of these small 
amounts of this unusual amino acid remains unclear. 

Another unique feature of type IV collagen is the presence of com- 
plex carbohydrate. These are attached through asparagine residues 
located in the helical regions. One was found on each of the a chains of 
the 7 S domain and the other on the P2 region of mouse type IV 
(Glanville et al., 1985; Schwartz et al., 1986). In addition to these com- 
plex sugars, every lysine residue occupying a Y position in a Gly-X-Y 
triplet is hydroxylated and glycosylated. Again, this is in contrast to 
the interstitial collagens in which very few of the lysine residues are 
hydroxylated and only some of these carry carbohydrates. A function 
for the carbohydrates still must be determined. 
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The amino-terminal of both chains isolated from the 7 S domain are 
blocked, probably by a pyroglutamic acid residue as is the case for all 
other sequenced collagen chains (Qian and Glanville, 1984). An amino- 
terminal sequence of the al chain, isolated from a trypsin-treated 
short form of 7 S, has been determined (Glanville et al., 1985). The a1 
chain has a 15-residue-long nontriplet sequence at its N-terminus 
which includes four cysteines and two lysines (Fig. 5B). All of these 
are capable of forming inter- and intramolecular cross-links. This is 
followed by 39 tripeptide units that include one cysteine residue in the 
sequence Gly-Cys-Asn-Gly-Thr-Hyl-Gly-Glu-Arg (Pos. 97-105), which 
is able to participate not only in a disulfide bridge but also in a lysine- 
derived cross-link. As this cysteine is present in a triple-helical portion 
of the molecule, the SH groups must project radially outwards from 
the helix and can, therefore, only form intermolecular disulfide 
bridges. It presumably reacts with one of the four cysteine residues in 
the N-terminal nonhelical region of an antiparallel-aligned neighbor- 
ing molecule. It is not known which of the four cysteines react, but an 
arrangement in which the cysteine at position 14 is used would also 
allow for the formation of lysine-derived cross-links (positions 102 to 
11) and would lead to an optimum interaction between the hydro- 
phobic amino acids of two antiparallel-oriented type IV collagen mole- 
cules. How the second pair of molecules is oriented with respect to the 
first pair is unknown, and it is at present not clear how the disulfide 
bridges between the three remaining amino-terminal cysteine resi- 
dues are formed. The position and number of cysteine residues in the 
a2 chain are the same as those in the al, whereas lysine residues are 
fewer. 

Considering the 7 S domain as a whole, there are 12 chains involved, 
each containing four cysteine residues. Therefore, as no free SH group 
has been detected in intact type IV collagen, there are 24 disulfide 
bridges to be located. Both inter- and intramolecular disulfide bonds 
are present. In the tissue form of type IV collagen, extensive lysine- 
derived cross-linking is evident, but the typical cross-linking site se- 
quence, Arg-His-Gly-Hy], that is present in all interstitial collagens, is 
not present in type IV. In pepsin-solubilized human placenta 7 S do- 
main, only about 10% of the chains contained in the 7 S domain are not 
involved in lysine-derived cross-linking. Approximately equal propor- 
tions of dimers, trimers, tetramers, pentamers, and hexamers are pre- 
sent, all containing both «al and a2 chains. A small proportion are 
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involved in aggregates of 7-12 chains (Qian and Glanville, 1984). The 
structure of the cross-links have not been investigated in any great 
detail. The typical collagen difunctional cross-links that can be labeled 
using tritiated sodium borohydride are present in the 7 S domain 
(Bailey et al., 1984). No hydroxypyridinium trifunctional cross-links 
have been found (Heathcote et al., 1982), but there are indications of 
other, as yet uncharacterized, multifunctional cross-linking structures 
(Scott et al., 1981, 1983). The high concentration of cross-links within 
the 7 S domain explains both its resistance to trypsin, pepsin, and 
collagenase digestion and its high thermal stability reflected in the 
denaturation temperature of 70°C (Risteli et al., 1980a). 

At the junction of the 7 S domain and the arms of the type IV 
tetramer, the molecules have a preferred angle of 40° (Hofmann et al., 
1984). This coincides with a long triple-helical sequence interruption 
in both the o1 and «2 chains, but the interruption is one residue short- 
er in the «2 than in the al (Fig. 5B). A similar situation is found in the 
complement factor Clq which contains three different polypeptide 
chains, A, B, and C, in a triple helix. At one position the helix makes a 
bend of 60° (Reid and Porter, 1976). Sequence studies have shown that 
at this position the A chain contains an extra threonine residue insert- 
ed between two triplets, the B chain has a continuous triplet sequence, 
and the C chain has a glycine substituted by an alanine residue (Reid, 
1979). These interruptions of the triplet structure allow a bending of 
the molecule without disruption of the triple helix (Kilcherr et al., 
1985). However, as differences in the length of the adjacent nontriplet 
sequences in the «l and «2 chains of type IV occur at several places 
along the molecule where no preferred angle was detected, there must 
be other functions for these regions. 


3. NC1 DOMAIN 


The carboxy-terminal noncollagenous domain NC1 can be isolated 
from bacterial collagenase digests of basement membranes (Timpl et 
al., 1981; Dean et al., 1983; Weber et al., 1984; Butkowski et al., 1985; 
Oberbäumer et al., 1985). When tissues are used as starting materials, 
the product is in fact a cross-linked dimer of the globular domains 
from two type IV molecules (Fig. 1D). About 50% of the human a1(IV) 
NC1 has been sequenced at the protein level, and complete sequences 
of both mouse and human al(IV) NC1 have been determined via 
cDNA sequencing (Oberbäumer et al., 1985; Brinker et al., 1985a; 
Pihlajaniemi et al., 1985). Species homology is very high with only 
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seven conservative differences between the mouse and human se- 
quences, which contain 229 amino acids (Section IX, positions 908— 
1136). This domain contains a high proportion of hydrophobic amino 
acids and 12 cysteine residues. Analysis of the sequence for internal 
repeats indicated that it consists of two homologous regions, each con- 
taining six cysteine residues (positions 908-1018 and 1019-1136). 
When the sequences are aligned to give maximum homology, 35% of 
the positions are identical and a further 21% are conservative sub- 
stitutions. Therefore, the NC1 domain is probably formed from two 
spatially very similar substructures, each stabilized by disulfide 
bridges which may have arisen by duplication of a gene unit. 

The function of the NC1 domain may be twofold. In analogy to the 
carboxy-terminal propeptides of the interstitial procollagens, this do- 
main may be involved in chain selection and helix formation during 
the biosynthesis of type IV. Unlike interstitial procollagens, this do- 
main is not removed and clearly plays an important role in the forma- 
tion of the extracellular macromolecular structures. The NC1 domain 
of two molecules react to form a dimer which is covalently cross-linked 
by, as yet uncharacterized, nonreducible cross-links. According to one 
model of type IV superstructure, the NC1 domain can also interact 
specifically with certain regions of the triple-helical portion of the type 
IV molecule (see Section II,a). 


C. Chain Composition of Molecules 


Despite the vast amount of detailed structural data available for 
type IV collagen, it is still a matter of debate whether one molecule 
contains both «1 or a2 chains, i.e., (a1),02(IV), whether two molecules 
exist as a1,(IV) and a2,(IV) homotrimers, or whether a combination of 
both possibilities is correct. There is considerable evidence to support a 
heterotrimeric molecule. This is based upon chromatographic behavior 
(Trüeb et al., 1982; Mayne and Zettergren, 1980), the characterization 
of disulfide-bonded pepsin fragments (Fujiwara and Nagai, 1981; Di- 
eringer et al., 1985), cross-linked chains from the 7 S domain (Qian and 
Glanville, 1984), and studies with specific monoclonal antibodies 
(Odermatt et al., 1984; Mayne et al., 1984). Evidence for homotrimeric 
molecules originates from work on mouse tumor type IV pepsin frag- 
ments where (P1), and (P2), native fragments were characterized 
(Timpl et al., 1979a). Support for this concept was provided by pre- 
cipitation experiments in which the ratios of the al(IV) to a2(IV) 
chains varied. However, in retrospect this was probably not valid as 
the preparations contained cross-linked type IV tetramers. The ratios 
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of al to a2 would therefore depend upon the extent of involvement of 
these two chains in lysine-derived cross-links in the 7 S domain. More 
recent support for homotrimeric molecules is based upon cyanogen 
bromide peptide mapping and pepsin digestion of the chains produced 
by rat embryo-derived parietal yolk sac cells (Haralson et al., 1985). 
However, this was not conclusive evidence because the cyanogen bro- 
mide peptide patterns of the al(IV) and a2(IV) as visualized on SDS- 
PAGE are very similar (Robey and Martin, 1981). Also, characteriza- 
tion of pepsin-digested material may lead to erroneous conclusions, as 
it has been shown that the a2 chain is much more sensitive to pepsin 
digestion (Tryggvason et al., 1980; Alitalo et al., 1980; Robey and Mar- 
tin, 1981) and may have been present as small fragments in the sam- 
ple that was characterized. The existence of homotrimeric molecules 
therefore remains a matter for debate. 


Ill. MoLEcULAR BioLoGY 


Applying genetic engineering technology to investigate the gene 
structure and control of genes of type IV collagen is in its infancy. 


A. Genomic DNA 


Genomic clones from the mouse «a2(IV) collagen chain have been 
characterized and show some interesting differences in structure to 
the interstitial collagens gene structure (Kurkinen et al., 1985). Three 
exons of the gene were sequenced which contained 64, 123, and 182 
base pairs and coded triple-helical sequences from the amino-terminal 
region of P2 (Section X, positions 1-140). In the interstitial collagens, 
all exons are multiples of nine base pairs, and there is considerable 
evidence that the exons evolved by amplification of an ancestral 54- 
base pair exon unit (Yamada et al., 1980). In type IV this is clearly not 
the case. Another difference is that interstitial collagen gene exons 
start with an intact glycine codon. In type IV, three out of the four 
exons characterized did not start with an intact glycine codon. The 
third base usage for glycine and proline also differs. In interstitial 
collagens it is most frequently U or C, but in the characterized a2(IV) 
exons it was not. These results suggest that the a2(IV) structure of the 
collagen gene is different and may have evolved independently from 
the fibrillar collagen genes. 

Similar conclusions were drawn from studies of the human al(IV) 
collagen gene (Soininen et al., 1986). Here, seven exons of 134, 73, 72, 
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129, 99, 213, and 178 base pairs were characterized which coded for the 
carboxy-terminal region of the helical domain and about half of NC1 
(Section IX, positions 970-1269). In addition it was shown that the 
introns were consistently large, the smallest being 960 and the largest 
more than 3500 base pairs. 


B. cDNA 


cDNA libraries from mouse parietal endoderm, human fibroblasts 
(Brinker et al., 1985), human tumor cell line HT-1080 (Pihlajaniemi 
et al., 1985), mouse F9 cells (Wang and Gudas, 1983), mouse EHS 
tumor and F9 cells (Oberbaumer et al., 1985), and mouse parietal endo- 
derm cells (Kurkinen et al., 1983) have been screened, using either 
antibodies or synthetic probes, to isolate clones containing «l(IV) and 
«2(IV) chain fragments. To date, only the carboxy-terminal one-third 
(1.e., Pl or P2 region) of the triple-helical domain and the complete 
globular domain (NC1) of both a1 and «a2 chains have been isolated 
and sequenced (see Sections II,B,1 and II,B,3). 


IV. BIOSYNTHESIS 


The biosynthesis of type IV collagen has been studied in a large 
number of cells and tissues in culture (see Table 4 in Timpl and 
Dziadek, 1986). Early data summarized by Kefalides (1978) indicated 
the synthesis of a single polypeptide chain with a molecular weight in 
the range of 135,000—180,000 depending upon the system studied. 
More recent reports describe the synthesis of two polypeptide chains, 
al(IV), with an apparent molecular weight of about 180,000, and 
a2(IV), about 165,000, in cell culture (Alitalo et al., 1980; Crouch et al., 
1980; Tryggvason et al., 1980; Fessler and Fessler, 1982; Foidart et al., 
1981) and in cell-free synthesis (Kurkinen et al., 1982; Sawhney and 
Dixit, 1985). This is in good agreement with the size of material that 
can be directly extracted from mouse EHS tumor (Kleinman et al., 
1982; Karakashian et al., 1982), indicating that the biosynthetic prod- 
uct is not enzymatically processed to a smaller tissue form as is the 
case for the interstitial collagens. Pulse—chase experiments have also 
failed to detect processing of type IV collagen (Minor et al., 1976; 
Dehm and Kefalides, 1978; Crouch and Bornstein, 1979; Alitalo et al., 
1980; Heathcote et al., 1978, 1980). There is some evidence from mea- 
surements of rotary shadowed molecules for the removal of small poly- 
peptide extensions at both the amino-terminal (Bächinger et al., 
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1982b) and carboxy-terminal (Oberbäumer et al., 1982). However, a 
true type IV procollagen molecule has still to be described. 

Synthesis of a1(IV) chains occurs in about 10 min (Fessler and Fes- 
sler, 1982). Hydroxylation of proline and lysine and glycosylation of 
hydroxylysine residues is complete in about 40 min. The completed 
type IV molecule is secreted after about 100 min (Pihlajaniemi et al., 
1981), which is considerably longer than the interstitial collagens 
which are secreted in about 20 min. The long synthesis time may 
reflect a slow association of « chains as the carboxy-terminal NC1 
domains are not interchain disulfide bonded, slow folding of the triple 
helix because of the presence of many triplet sequence interruptions, 
or a long intracellular pathway. 

Secreted type IV molecules contain interchain disulfide bridges. In 
cell cultures in the presence of a,a-dipyridyl, which inhibits proline 
and lysine hydroxylation, disulfide bridge formation is inhibited and 
random-coiled chains are secreted (Crouch and Bornstein, 1979; Fes- 
sler and Fessler, 1982). In contrast, in a tissue culture of parietal yolk 
sacs, it was reported that secreted type IV collagen was disulfide bond- 
ed but was nevertheless a random coil, as judged by its susceptibility to 
proteolysis (Karakashian and Kefalides, 1982). 

The association of monomer type IV molecules into tetramers has 
been studied in a cell culture system in some detail (Duncan etal., 1983). 
Most of the secreted molecules remain as monomers or tetramers in cell 
culture medium, although a small fraction of dimers is present (Bäch- 
inger et al., 1982a; Oberbäumer et al., 1982). The monomers associate 
into tetramers, initially through noncovalent interactions (Duncan et 
al., 1983). This interaction is concentration dependent and reversible. 
Disulfide bonds are then formed between adjacent monomers, probably 
by disulfide exchange to form the 7 S domain. Dimer formation through 
interaction of the NC1 domains was rarely observed. In the cell layer, 
the tetramers are quickly incorporated into an insoluble matrix by 
formation of cross-links between NC1 domains and further lysine- 
derived cross-linking of the 7 S domain (Heathcote et al., 1978, 1980; 
von der Mark et al., 1985; Taylor and Grant, 1985) (see Section II,B,2). 

The actual conformation of the secreted collagen molecule is not 
known. In outgrowths from embryonic explants of mouse lung, type IV 
collagen can be deposited in the form of a fibrillar network (Chen and 
Little, 1985), in contrast to the amorphous deposits of epithelial cells 
in culture. It was suggested that, during embryogenesis, components 
of basement membranes and of connective tissue extracellular ma- 
trices may not be as highly segregated as in later development. Devel- 
opmental changes in basement membranes during embryogenesis 
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have also been demonstrated in the chick lens capsule (Fitch and Lin- 
senmayer, 1983; Fitch et al., 1983), as has heterogeneity within one 
membrane. Microfibrillar structures have been observed in some ma- 
ture basement membranes, for example the lung (Low, 1961), Re- 
ichert’s membrane (Inoue et al., 1983), and mouse EHS tumor (Inoue 
and Leblond, 1985), and in glomerular basement membrane (Farquhar 
et al., 1961; Laurie et al., 1984). It appears that the structure and 
composition of basement membranes may not be as constant as bio- 
chemical data suggest. Perhaps type IV collagen can assume different 
conformations as required, although the two models that have been 
suggested so far are not compatible with a fibrillar structure. 


V. IMMUNOCHEMISTRY 


A. Polyclonal Antibodies 


Antibodies to type IV collagen have been used extensively in the 
immunohistological analyses of basement membranes in normal and 
pathological tissues (see Table 3 in Timpl and Dziadek, 1986). Poly- 
clonal antisera have been raised against the 7 S domain (Risteli et al., 
1981; Madri and Williams, 1983; Dixit et al., 1984), NC1 (von der Mark 
et al., 1985), various pepsin fragments (Timpl et al., 1979c; Yaoita et 
al., 1978; Garbisa et al., 1981; Mayne et al., 1982) and extracted type IV 
from the EHS tumor (Timp! et al., 1978, 1979c; Madri and Furthmayr, 
1980; Chen and Little, 1985), and lens capsule (Brinker et al., 1985b). 
The 7 S and NC1 domains are strongly immunogenic, whereas the 
triple-helical domain is only weakly immunogenic. 

In addition to immunohistochemical studies, these antibodies have 
been used in radioimmunoassays to detect the concentration of anti- 
gen in body fluids and cell culture media (Bowman et al., 1980; Risteli 
et al., 1981, 1985; Brocks et al., 1985; von der Mark et al., 1985; Schup- 
pan et al., 1985) as well as for a number of structural studies. It was 
possible in one case to characterize antibodies specific for the a1(IV) 
and a2(IV) chains (Risteli et al., 1980b), again reflecting the dis- 
similarity of the two chains at the primary structural level as dis- 
cussed earlier (Section II,B,1). Cross reactivity of an antibody with a 
purified 7 S domain preparation and a helical pepsin fragment, iso- 
lated from bovine kidney cortex, was used to demonstrate that the 7 S 
domain is composed of the amino-terminal ends of the type IV collagen 
molecule (Dixit et al., 1983). Antibodies to type IV collagen were also 
instrumental in both understanding the pathophysiology of the tumor 
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cell invasive process and the recognition of its presence in tissue sec- 
tions (Barsky et al., 1983). 


B. Monoclonal Antibodies 


Monoclonal antibodies have been prepared against the mouse NC1 
domain (von der Mark et al., 1985), human type IV pepsin fragments 
(Sakai et al., 1982; Sundarraj and Wilson, 1982; Foellmer et al., 1983; 
Odermatt et al., 1984; Scheinman and Tsai, 1984), and chick type IV 
pepsin fragments (Fitch et al., 1982; Mayne et al., 1983, 1984). With 
one exception (see below), these antibodies are conformation specific. 
Mapping the antigenic determinants by rotary shadowing antigen 
antibody complexes (Fig. 6) revealed that there are three major deter- 
minants, one in the carboxy-terminal half of the triple helix (Mayne et 
al., 1984; Odermatt et al., 1984), one in the 7 S domain (Mayne et al., 
1984), and one approximately 55 nm from the 7 S domain (Foellmer et 
al., 1983; Mayne et al., 1984; Dieringer et al., 1985). Although the 
position of epitopes is similar in different species, the antibodies do not 
always cross react, indicating that the structures of the epitopes are 
different. 

Three monoclonal antibodies against chick type IV collagen were 
used in combination with rotary shadowing to align the native pepsin 


Fic. 6. Electron micrograph of a pepsin solubilized human type IV tetramer, with 
bound antibodies, prepared by the rotary-shadowing technique. The monoclonal anti- 
bodies, indicated by arrows, are bound to a site approximately 60 nm away from the 7 S 
domain. Bar, 100 nm. 
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fragments with the intact molecule (Mayne et al., 1984). Two mono- 
clonal antibodies, one against a native pepsin fragment of type IV 
collagen and one against the denatured «al(IV) chain, have been used 
to show that native type IV molecules are heterotrimers (Odermatt et 
al., 1984) (see Section II,B,3). Finally, a combination of peptide chem- 
istry and immunochemistry enabled an interchain disulfide bridge to 
be localized to a position close to the antigenic site (Dieringer et al., 
1985). 


C. Electron Microscopic Localization 


Immunoelectron microscopy has been used to demonstrate different 
compositions of the lamina rara and lamina densa. Type IV collagen 
was intially localized in glomerular basement membranes to either the 
lamina densa (Courtoy et al., 1982) or throughout the basement mem- 
brane (Roll et al., 1980; Mynderse et al., 1983). Subsequently, studies 
on other basement membranes demonstrated a concentration of type 
IV collagen in the lamina densa but with projections into the lamina 
rara (Laurie et al., 1982, 1984; Monaghan et al., 1983; Fleischmajer et 
al., 1985). These apparently conflicting results may be due to artifacts 
of fixation and antibody labeling, particularly when peroxidase is used 
(Courtoy et al., 1983). In fact, it may not be possible to localize mac- 
romolecules to a specific region of basement membranes because their 
extended structure spans the entire width of the membrane. The ap- 
parent localization would then depend upon the position of the epi- 
topes against which the antibodies are directed. There is also an appar- 
ent alteration in the distribution of type IV collagen during the 
development of diabetic microangiopathy in hyperglycemic rats (Ben- 
dayan, 1985). In normal animals it was localized to the lamina densa, 
but in diseased animals it was restricted to the subendothelial site of 
the lamina. 


VI. DEGRADATION 


Proteases that specifically degrade type IV collagen have been diffi- 
cult to characterize, presumably because of the presence of nontriplet 
sequences in the type IV collagen molecule which render it susceptible 
to attack by many nonspecific proteases. Therefore, the fact that a 
given enzyme preparation can solubilize type IV collagen is not a suffi- 
cient criteria to justify the designation type IV collagenase. For exam- 
ple, leukocyte elastase (Uitto et al., 1980; Mainardi et al., 1980) and 
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bone metalloproteinases (Murphy et al., 1981, 1985) both degrade not 
only type IV collagen but also laminin and other matrix proteins. 

A specific type IV collagenase has been isolated from cell cultures 
derived from a malignant tumor (Liotta et al., 1979) and the enzyme 
characterized (Salo et al., 1983). It is a neutral metalloproteinase of 
60,000-70,000 Da which is activated by trypsin. It has no action 
against the interstitial collagen types I, II, III, or V but cleaves both 
chains of type IV collagen at one specific locus. The fragments pro- 
duced have been characterized (Fessler et al., 1984) and the cleavage 
point located approximately 90 nm from the 7 S domain. This is not far 
from one of the antigenic determinants for monoclonal antibodies (Di- 
eringer et al., 1985; Fitch et al., 1982), an interchain disulfide bridge 
(Dieringer et al., 1985), and is a region of high flexibility (Hofmann et 
al., 1984). As this region contains both triplet and nontriplet sequences 
(unpublished data), it is not known whether the enzyme cleaves the 
three chains in a triple-helical or nonhelical region of the molecule. 


VII. INVOLVEMENT IN DISEASE PROCESSES 


Basement membranes are involved in a large variety of diseases 
either as the site of the primary lesion (as in autoimmune diseases 
such as Goodpasture syndrome, epidermolysis bullosa, bullous 
pemphigold), or in secondary complications (as in diabetes). Basement 
membranes are also degraded when cells, particularly tumor cells, 
invade tissues (for a review, see Liotta et al., 1983; Martinez-Her- 
nandez and Amenta, 1983; Timp] and Dziadek, 1986). However, a spe- 
cific involvement of type IV collagen in these pathological conditions 
has, in most instances, not been shown. 

Only in one pathological condition, Goodpasture syndrome, is there 
good evidence for a direct involvement of type IV collagen. Good- 
pasture syndrome, a rare form of glomerulonephritis, is an autoim- 
mune disease in which the antigen resides in the basement membrane 
on a collagenase-resistant component (Marquardt et al., 1973b; Fish et 
al., 1984; Yoshioka et al., 1985a,b; Wieslander et al., 1984b), later 
shown to originate from the noncollagenous NC1 domain of type IV 
(Wieslander et al., 1984a,b; Butkowski et al., 1985). The epitope is 
concealed inside the hexameric structure of the NC1 domain in normal 
tissue (Wieslander et al., 1985). It has also been possible to cause a 
Goodpasture-like syndrome by injecting mice with NC1 domain (von 
der Mark et al., 1985). 

Epithelial cell tumor invasion involves cells passing through various 
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connective tissues and associated basement membranes (Liotta et al., 
1983). Tumor cells elaborate degradative enzymes, one of which is the 
type IV collagenase (see Section VI), which enable them to move 
through connective tissues. Benign tumors are surrounded by a contin- 
uous basement membrane, whereas most malignant tumors have very 
fragmented or no basement membranes (Barsky et al., 1983). Col- 
lagenolytic activity of various tumor cell lines correlates well with 
their metastatic potential (Liotta et al., 1980). 


VIII. INTERACTIONS WITH CELLS AND OTHER PROTEINS 


Now that the major components of basement membranes have been 
isolated and characterized, attention is turning to both how they in- 
teract with each other to form a membrane and which components 
within the membrane are responsible for interaction with cells. Al- 
though it is generally believed that epithelial cells do not attach di- 
rectly to type IV collagen but utilize laminin as an attachment factor, 
a native type IV collagen binding protein called colligin has been iso- 
lated from the cell membrane of a variety of cell lines (Kurkinen et al., 
1984). It is a 47,000 Da glycoprotein which, it is speculated, either may 
be involved in the assembly of type IV collagen into a network or may 
be an intracellular component involved in the biosynthetic pathway 
that reaches the cell surface as a result of fusion of secretory vesicles 
from the Golgi with the cell membrane. Some cells do, however, attach 
readily and directly to type IV collagen, for example, aortic endothelial 
cells (Palotie et al., 1983), hepatocytes (Rubin et al., 1981), keratino- 
cytes (Murray et al., 1979), PF-HR9 cells (Engvall et al., 1982), and 
metastatic melanoma cells (Dennis et al., 1982). Asparagine-linked 
oligosaccharides on the cell surface have been implicated in this at- 
tachment (Dennis et al., 1984). 

Most attention has been paid to the interaction of type IV collagen 
with laminin and heparin sulfate proteoglycan. Laminin has a higher 
affinity for native type IV collagen than for denatured type IV or other 
collagen types, but it is nevertheless a weak affinity. Digesting the 
native type IV with pepsin, which removes the NC1 domain and about 
100 nm of the helix, destroyed laminin binding, indicating that the 
binding site resides in NC1 or close to it (Rao et al., 1985). Conversely, 
it was reported that isolated NC1 did not bind to laminin but pepsin- 
treated material did (Charonis et al., 1985). Using thrombin or pepsin 
fragments of laminin, it was shown that the type IV binding site 
resides in the globular regions of the three short arms of laminin 
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(Woodley et al., 1983; Charonis et al., 1985). When binding studies are 
carried out in solution, a precipitate is formed which contains approx- 
imately equimolar amounts of type IV and laminin (Kleinman et al., 
1983). 

Heparan sulfate proteoglycan also binds to both laminin and type IV 
collagen (Kleinman et al., 1983; Fujiwara et al., 1984; Woodley et al., 
1983). Precipitation studies using all three components resulted in a 
complex with a constant composition which contains type IV collagen, 
laminin, and heparin sulfate proteoglycan in molar ratios of 10:10:1, 
respectively, which are similar to those found in the EHS mouse tumor 
(Kleinman et al., 1983; Grant et al., 1985). 

This is probably a gross simplification of the in vivo situation as 
interactions may be greatly modified by other minor components of 
basement membranes such as nidogen, entactin, or the unknown epi- 
dermolysis bullosa (Woodley et al., 1984; Paller et al., 1985) and bull- 
ous pemphigold (Stanley ei al., 1981) antigens. There is also no infor- 
mation available about either the order in which basement membrane 
components are laid down or their spatial relationship to one another 
within a membrane. Although biochemical investigations would indi- 
cate otherwise, there are some indications, based upon observation of 
in vivo systems, that at least some basement membranes exhibit thix- 
otropic properties (Simpson, 1980, 1981). If this is indeed true, then 
interactions between various membrane components may be ex- 
tremely weak, making the demonstration of specific interactions be- 
tween the collagenous network and other components very difficult to 
define biochemically. 


IX. APPENDIX A: al(IV) AMINO ACID SEQUENCE 


Amino acid sequence of a large cyanogen bromide peptide located at 
the amino-terminal of human a1(IV)140 (positions 1-255) (Glanville, 
unpublished data), human a1(IV)95 (positions 226-1162) from Babel 
and Glanville (1984), and human NC1 from Pihlajaniemi et al. (1985) 
and Brinker et al. (1985a), whose results were in complete agreement. 
The amino acids shown below the main sequence are substitutions 
found in the corresponding sequence of mouse P1 (Schuppan et al., 
1982) and NC1 (Oberbäumer et al., 1985). P1 was corrected according 
to the results of Oberbäumer et al. (1985). (*, Indicates hydroxylated 
proline and lysine residues; +, indicates an asparagine-linked carbo- 
hydrate; and ( ), enclose nontriplet sequences.) 
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155 160 165 170 175 180 
* 


* * * * * * 

PGIPGTPGPPGLPGLQGPVGPPGFTGPPGP 
185 190 195 200 205 210 

* * * * * 

PGPPGPPGEKGQMGLSFQGPKGODKGOD Q<GV K 
215 


EPPKVP....> 


X. APPENDIX B: «@2(IV) AMINO ACID SEQUENCE 


Amino acid sequence of mouse P2 (positions 1-511) (from Schwartz 
et al., 1986) and NC1 (positions 512-738) (unpublished data kindly 
supplied by K. Kühn). Symbols as in Section IX. 


125 130 135 140 145 150 
* * * * * 
DTVDLPGSPGLKGERGITGIPGLKGFFGERK 
155 160 165 170 175 180 
* x * x 
GAAGDIGFPGITGMAGAQGS PGLKGQTGFP 
185 190 195 200 205 210 
* * * x * 
GLTGLQGPIGEPGRIGIPGDKGDFGWPGVP 
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695 700 705 710 715 720 
CHYFANKYS FWLTTIPEQNFQSTPSAODTLK 


725 730 735 
AGLIRTHISRCOVCMKNL 


This amino acid sequence corresponds to the long form of human 
a2(IV) 7 S (Glanville, unpublished data). 


5 10 15 20 25 30 
x x 
<K FDVPCGGRDCSGGCQCYPEKGGRGQPGPV 
35 40 45 50 55 60 
x * x x 


155 160 165 170 175 180 


185 190 195 200 205 210 


GPPGPPGPPGQQGNRGLGFYGVKGEKGDVGQ 
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I. INTRODUCTION 


This review is focused on the biosynthesis of collagen V, the struc- 
ture of the procollagens V, and their processing. In addition, an at- 
tempt is made to summarize current information regarding the fiber 
form of collagens V, their occurrence in various connective tissues, and 
their susceptibility to degradative enzymes. Originally, this collagen 
type was reported as AB-collagen, which had been extracted from 
tissues after pepsin digestion (Burgeson et al., 1976; Chung et al., 
1976). This material was then named type V collagen (Bornstein and 
Sage, 1980). As pepsin digestion removes significant noncollagenous 
peptide regions, we refer to such preparations as collagen V (pepsin), 
in contrast to the term collagen V which implies the end stage of 
physiologically processed material. In most of the literature this dis- 
tinction is not made. Collagen V is also a generic term for all variants 
of this collagen type. Reports of amino acid analyses and sequences are 
mostly on pepsin-extracted material (Bornstein and Sage, 1980). Al- 
though these analyses showed distinct differences from the fibrous 
collagens I, II, and III, which are also evident in other ways, type V 
collagen is frequently associated with collagens I and III. 


II. COMPOSITIONS OF COLLAGENS V 


Four chains have been described for collagen V: a1 and al’, a2, and 
a3 (Bornstein and Sage, 1980). Two related collagen chains were iso- 
lated from cartilage and named lao and 2a (Burgeson and Hollister, 
1979). They are regarded as distinct gene products from al(V) and 
a2(V) chains and are structurally different, though they have substan- 
tial similarities in amino acid and carbohydrate composition, sol- 
ubility properties (Reese and Mayne, 1981; Burgeson et al., 1982), and 
susceptibility to proteases (Liotta et al., 1982), and show immunologi- 


81 


STRUCTURE AND FUNCTION OF Copyright © 1987 by Academic Press, Inc. 
COLLAGEN TYPES All rights of reproduction in any form reserved. 


82 JOHN H. FESSLER AND LISELOTTE I. FESSLER 


cal cross-reactivity (von der Mark and Öcalan, 1982; Gay et al., 1981a). 
The a(V) chains are defined by the pattern of peptides obtained after 
cyanogen bromide cleavage of the pepsin-derived chains as well as by 
other properties (Bornstein and Sage, 1980). The principal, widely dis- 
tributed chains are a1(V) and «2(V). Several methods have been devel- 
oped for the identification and quantitation of these chains in differ- 
ent tissues, based on the characteristic peptides formed by cleavage 
with cyanogen bromide and proteases (Rhodes and Miller, 1979, 1981; 
Rhodes et al., 1981; Miller et al., 1983; von der Mark and von der Mark, 
1979; Tanaka et al., 1981; Cole and Chan, 1981; Kelly et al., 1981; Sage 
et al., 1981a; Reiser and Last, 1983; Bellon, 1985). 

A homotrimeric molecule of al(V) chains was found in a hamster 
lung cell line (Haralson et al., 1980, 1984; Haralson and Mitchell, 
1981) and is said to exist in liver (Biempica et al., 1980). In vitro 
reconstitution experiments of pepsin-derived chains indicated that a 
corresponding homotrimer of «2(V) chains would be unstable (Bentz et 
al., 1978). Initially, various combinations of a1(V) and a2(V) chains 
were proposed according to the varied recovery of these chains from 
tissues, though the heterotrimer (a1),.02 was particularly likely (Bur- 
geson and Hollister, 1979; Rhodes and Miller, 1978, 1981; Deyl et al., 
1979; Sage et al., 1981b; Trelstad et al., 1981; Madri et al., 1982; Abedin 
et al., 1982). A disulfide-linked form of the corresponding three precur- 
sor chains, (proa1),proa2(V), was found in small amounts (Kumamoto 
and Fessler, 1980). From tissues in which the additional «3(V) chain 
occurs, the heterotrimers (a1),02(V) and ala2a3(V) have been sepa- 
rated and the stoichiometry of their components demonstrated 
(Rhodes and Miller, 1981; Abedin et al., 1982; Niyibizi et al., 1984; van 
der Rest et al., 1985). These investigators failed to find the homotrimer 
(a3)3(V), which has been reported (Madri et al., 1982). Experiments 
assessing the susceptibility of these molecules to protease digestion 
also confirm the existence of heterotrimers (Sage et al., 1981b). The 
different chain compositions impart slightly different stabilities and 
flexibilities to the molecules. For example, while the (a1),02(V) het- 
erotrimer melts at 39°C, the ala2a3 heterotrimer dissociates at 36°C 
(Niyibizi et al., 1984), and the latter combination displays a greater 
flexibility as determined by analysis of rotary shadowed molecules (J. 
Rauterberg et al., personal communication). In different tissues, type 
V molecules with different compositions may be better suited for their 
particular functions. 

An additional chain was obtained from chick embryo tendon and 
initially named «4(V) (Fessler et al., 1985b), and then al’(V) because 
of the similarity of its properties to the al1(V) chain (Fessler et al., 
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1985a). An equivalent chain has been described in human epitheloid 
lung cells (Bloch et al., 1986). The precursor forms of the al(V) and 
al’(V) chains and their physiologically derived pa- and a-chains differ 
distinctly from each other in the substantial noncollagenous peptide 
regions that are retained at one end of the chains. In the al'(V) series, 
this region is smaller and differs in peptide fragments which carry 
sulfated tyrosine residues and in glycosylation (Fessler et al., 1985a, 
1986b). The noncollagenous peptide regions at the other end of the 
main collagen domain are of similar size but differ in their propensity 
to form disulfide cross-links to companion chains in the trimeric pro- 
collagen molecules. After pepsin treatment, it has not been possible to 
separate the al(V) (pepsin) and «1’(V) (pepsin) chains. The cyanogen 
bromide peptide pattern of such mixed a(V) (pepsin) chains did not 
show clear differences. We suggest that the proal’(V) and proal(V) 
chains are either the products of two similar genes, which differ pri- 
marily in the noncollagenous peptide regions, or that they arise from 
one gene by different splicing of RNA. The proa1'(V) chains associate 
into a homotrimer and into the heterotrimer (proal proal’ proa2)(V). 


III. NUCLEOTIDE SEQUENCES OF PROCOLLAGENS V 


Recombinant human cDNA sequences were first correlated with the 
CB9 cyanogen bromide peptide in the carboxyl-end collagen region of 
a2(V) (pepsin) (Myers et al., 1985a). The pattern of charged groups 
found in the «l(D, «2(D), and a@1(III) chains is strongly conserved in 
this region, and the results showed that these three interstitial col- 
lagen chains are more similar to each other than to a2(V). The region 
918-944 lacks any prolines, a finding that had also been made on 
al(I). The telopeptide lacks lysine residues and therefore no lysine- 
derived cross-links could occur in this region of this chain. The DNA 
sequences were then extended in the 3’ direction to cover all of the 
carboxyl propeptide (Myers et al., 1985b). The carboxyl propeptide of 
a2(V) has a greater degree of similarity to the corresponding domains 
of the interstitial procollagens than to the basement membrane pro- 
collagen IV. The carboxy] propeptide has the same number of cysteine 
residues as in proa2(I), and most are at the same sites. There is also a 
carbohydrate attachment site at the same position as in the interstitial 
procollagens, and there may be an additional carbohydrate attachment 
site within the hypervariable N-terminal region of this carboxyl] pro- 
peptide. With this cloned DNA, the human proa2(V) gene was located 
on chromosome 2 at q24.3 to q31, a site where the gene for proal (III) is 
also located (Emanuel et al., 1985). 
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Fic. 1. Diagram of structure of proa(V) chains and their derivatives in chick embryo. 
The central collagenous sequence is indicated as a thin line and is assigned a nominal 
molecular size of 100 kDa. The nominal molecular sizes of the flanking, noncollagenous 
sequences are indicated. The proa2(V) chain may have an additional, small amino- 
terminal sequence that would have been missed by the analytical techniques that were 
used. By analogy with the interstitial procollagens, there could also be short, terminal 
collagenous sequences separated from the main collagen helix by telopeptide regions. 
The carboxyl end of each peptide is presumed to be at the right side of the diagram, 
though this is only known for the proa2 chain. Upon conversion of proa2(V) to pa2(V) 
two or more peptides, called P-peptides, become disulfide linked to the noncollagenous 
region of pa2(V). The chains occur in various trimeric combinations with different 
degrees of interchain disulfide linkage, as given in the text. These covalent connections 
between the main polypeptide chains are lost on conversion to p-collagens (V). 


IV. STRUCTURE OF PROCOLLAGENS V 


The domain structure of procollagens V is equivalent to that of the 
interstitial collagens (Fig. 1). A central collagen helix of practically 
the same length as that of type I has knoblike projections of non- 
collagenous propeptides at each end, as shown by electron microscopy 
of sprayed, rotary shadowed molecules (Bächinger et al., 1982; Fessler 
et al., 1982). The two knobs have different diameters. After physiologi- 
cal processing, a small knob is still seen at one end of the molecule 
(Elstow and Weiss, 1983; Broek et al., 1985), and presumably this 
corresponds to a residual portion of the original amino propeptide. 
However, at present the only definitive structural location of the type 
V propeptides is the above recombinant DNA sequence of the carboxyl 
propeptide of proa2(V). This sequence corresponds to the noncol- 
lagenous peptide of approximately 40 kDa, as judged by SDS--poly- 
acrylamide gel electrophoresis, that is formed from pro«a2(V) chains 
upon digestion with bacterial collagenase (Kumamoto and Fessler, 
1981). Slightly smaller sized, equivalent noncollagenous peptides are 
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obtained from proal(V) and proal'(V), with nominal molecular size of 
35 kDa (Kumamoto and Fessler, 1981; Fessler et al., 1985a). (These 
authors rounded off all nominal molecular sizes to the nearest 5 kDa.) 
This suggests that a trimeric carboxyl propeptide simlar to that of the 
interstitial procollagens exists. However, there are also some dif- 
ferences. The diameter of the knoblike carboxy] propeptides of sprayed, 
rotary shadowed procollagen I was intermediate between the diameters 
of the two procollagen V projections (Bachinger et al., 1982). The in- 
terchain disulfide linking of the carboxyl V propeptides is usually 
incomplete, and the peptide connection to the collagen helical portion of 
the proal(V) and proal'(V) chains is cleaved much earlier than that of 
proa2(V) (Kumamoto and Fessler, 1981). It is assumed that in the type 
V collagen helix the amino-to-carboxyl sense of the constituent pep- 
tides is parallel. 

The molecular masses of the procollagens V assigned from elec- 
trophoretic mobilities of these hybrid proteins are subject to error, as 
the collagen and noncollagen peptide regions contribute differently to 
the net mobilities. Glycosylation and internal disulfide links cause 
additional anomalies. However, within a precursor-to-product related 
series of peptides, defined by pulse-chase experiments, there is good 
correlation of electrophoretic mobilities and ultracentrifuge sedimen- 
tation velocities, supporting the concept of sequential proteolytic pro- 
cessing of precursors (Fessler et al., 1981a). There are a number of 
reports of large, recently formed potential precursors of collagen V 
(pepsin) chains (e.g., Linsenmayer and Little, 1978; Jimenez et al., 
1978; Kumamoto and Fessler, 1980, 1981; Foidart et al., 1981; Alitalo 
et al., 1982; Narayanan and Page, 1983; Kao et al., 1983; Elstow and 
Weiss, 1983). The nominal size of the human proal(V) chain is 220- 
240 kDa and of the proa2(V) chain is 150-170 kDa (Alitalo et al., 1982; 
Narayanan and Page, 1983). Differences may only be due to experi- 
mental conditions and the molecular-weight standards that were used. 
The summed values for the sizes of the collagen helical portions and 
the propeptides in chick embryo are about 220 kDa for the proal(V) 
chain and 140 kDa for prow2(V) (Kumamoto and Fessler, 1981), and 
the relationships between these values agree with the relative sedi- 
mentation coefficients of the denatured chains (Fessler et al., 1981a). 
The proal(V) chain of hamster lung cells is the same size, 220 kDa 
(Fessler et al., 1981b). These determinations were made on chains that 
were identified by peptide mapping to correspond with the previously 
isolated «1(V) and «2(V) (pepsin) chains. When hydroxylation and the 
accompanying O-linked glycosylation were prevented by synthesis in 
the presence of 2,2'-dipyridyl, the apparent size of the proa1(V) chain 
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was decreased from 220 to 200 kDa, and the proa2(V) chain from 150 
to 145 kDa (Alitalo et al., 1982). A separate small decrease of apparent 
size occurred when attachment of N-linked carbohydrate was inhib- 
ited by incubation in the presence of tunicamycin (Fessler et al., 
1985a). The chick proal’(V) chain is about 15 kDa smaller than the 
proal(V) chain, and this difference is ascribed to a decrease in the size 
of the presumed amino propeptide. Digestion with bacterial col- 
lagenase showed that the chick proal(V) and proal'(V) chains have 
large noncollagenous peptides at both ends of the collagen region 
(Kumamoto and Fessler, 1981; Fessler et al., 1985a). Assignment of 
the 35 kDa noncollagenous peptides as carboxyl propeptides, by anal- 
ogy with the proa2(V) chain, leaves 85 and 70 kDa noncollagenous 
peptides as amino propeptides of, respectively, proal(V) and 
proal’(V). In chick, the proa2(V) chain lacks a large noncollagenous 
peptide at the amino end such as could be detected by the poly- 
acrylamide gels used in these investigations. However, either a short 
collagenous region, as in proa2(I), or some additional, smaller se- 
quences of amino acid residues might, of course, occur. 


V. BIOSYNTHESIS 


Embryonic chick tendons synthesise proal(V) and prow1'(V) chains 
in about 10-12 min at 37°C (Fessler et al., 1986b) (Fig. 2). This is 
consistent with the synthesis times of pro«1(I) and pro«1(IV) and their 
relative electrophoretically determined sizes. Within each propeptide, 
intrachain disulfide links are formed first, as judged by changes in 
electrophoretic mobility on reduction. Assembly of trimeric molecules 
is relatively slow (Kumamoto and Fessler, 1981; Fessler et al., 1981a; 
Alitalo et al., 1982). In human rhabdomyosarcoma A-204 cells, the 
first interchain disulfide-linked forms appeared 30 min after a 5-min 
pulse. Eventually, all of the prow2(V) chains formed disulfide-linked 
oligo/multimers of prowl(V) and proa2(V) chains which were not fur- 
ther characterized. Some proal(V) chains were not disulfide linked, 
and the proal(V) was present in excess of 2:1 over the proa2(V) chain 
(Alitalo et al., 1982). Human gingival fibroblast cultures also formed 
procollagen V molecules with interchain disulfide links, but the de- 
gree of linkage was not determined (Narayanan and Page, 1983). The 
(proa1),(V) made by hamster lung cells lacked any interchain disul- 
fide bonds (Fessler et al., 1981b). In contrast, the homotrimer 
(prow1'),(V) made by chick embryo tendon is fully interchain disulfide 
linked (Fessler et al., 1985a,b). Mostly incomplete disulfide linkage 
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Fic. 2. Temporal relationships of biosynthesis and modifications of procollagens V in 
chick embryo tissues. Formation of individual proa (V) chains occurs in the rough 
endoplasmic reticulum (RER) and takes about 10 min. The indicated modifications of 
these chains and their triple-helical assembly are also assigned to the RER by analogy 
with other procollagens. Chick tendon procollagen V molecules are various trimeric 
combinations of pro«2(V) and the similar, but different, proal(V) and proa1'(V) chains. 
The procollagen V molecules with N-linked carbohydrate that is degraded by endo- 
glycosidase H (EndoH sensitive) are assigned to RER and the cis Golgi compartment. 
Sulfation of tyrosine residues is assigned to some other Golgi compartment, possibly the 
trans stack, and occurs about 50 min after completion of peptide synthesis. The sulfated 
procollagen V molecules are secreted by cell cultures. Extracellularly, the carboxy] 
propeptides are considered to be cleaved off first, and the resulting sulfated pal(V) 
chains are formed more rapidly than their pal'(V) counterparts. Several hours later, 
distal portions of the remaining, large noncollagenous domains are removed, but most of 
the sulfated tyrosine residues are retained in the residual noncollagenous domains of 
collagens (V). Further processing may occur on residence in the tissues. The unsulfated 
procollagen V molecules elute in a low-salt fraction from DEAE-cellulose, while all their 
derivatives elute later in a high-salt fraction. 


was found in the trimeric procollagen molecules made by various chick 
tissues. Attempts to increase the time of intracellular residence of 
these procollagen molecules did not increase the degree of disulfide 
linkage (Kumamoto and Fessler, 1981; Fessler et al., 1981a). Although 
reversal of existing disulfide bridges by disulfide exchange during 
extraction cannot be excluded, this was guarded against, and the pro- 
collagens I and III that were coextracted were normally disulfide 
linked. 

The procollagen V molecules move through the cell in the sequential 
order in which they are synthesized and undergo further modifications 
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in the Golgi organelle, probably in the trans-Golgi, 50 min after the 
completion of polypeptide synthesis. The N-linked carbohydrate 
changes from having been sensitive to endoglycosidase H to being 
resistant to this enzyme, though it is still cleaved by endoglycosidase F 
(Fessler et al., 1986b). Several tyrosine residues become O-sulfated, 
and this is accompanied by changes in the charge of the molecules 
(Fessler et al., 1986a,b). Within experimental limits, all of the sulfate 
bound to procollagens V is attached in this form and is not on carbohy- 
drate. About 40% of the tyrosine residues of the amino propeptide of 
proal(V) and proal’(V) chains become sulfated, with the two chains 
giving different maps of tyrosine sulfate-labeled peptide fragments 
after cleavage with proteases. The sulfated tyrosine residues are near- 
ly all in the presumed amino propeptides, and most or all are in those 
portions of these noncollagenous regions which are retained during 
physiological processing to collagens V. The prow2(V) chain is only 
weakly tyrosine sulfated, and this is consistent with its lack of a large 
amino propeptide. The tyrosine sulfation proceeds independently of 
the attachment of N-linked carbohydrate, i.e., it continues when this 
has been suppressed with the drug tunicamycin. A cell-free system of 
Golgi-enriched membranes, together with the sulfate donor 3’-phos- 
phoadenosine 5’-phosphosulfate (PAPS), sulfated tyrosine residues 
both in endogenous and in added unsulfated procollagens V. The en- 
zyme is presumably membrane bound, and detergent disruption of the 
vesicles is required for added procollagen to be sulfated. All of the 
procollagen V becomes sulfated, but only shortly before secretion and 
cleavage to p-collagens V. Therefore, most of the extracted procollagen 
V is not sulfated. This tyrosine sulfation of procollagens V occurs in all 
tested chick embryo tissues, in neonatal rat, and in cultured human 
cells. The antimicrotubular drug Nocodazole speeds up tyrosine sulfa- 
tion of procollagens V without influencing the final degree of sulfation 
(Fessler et al., 1986a,b). 


VI. SECRETION OF PROCOLLAGENS V AND PROCESSING 


All of the procollagens V and their derivatives found in culture 
media of human and chick cells were tyrosine sulfated. Secretion and 
cleavage of procollagens V rapidly follow sulfation (Fessler et al., 
1986a,b). As with the interstitial procollagens, secretion is inhibited 
when hydroxylation of the procollagens V has been prevented by syn- 
thesis in the presence of 2,2’-dipyridyl (Alitalo et al., 1982). Also, as 
with the interstitial procollagens, addition of arginine to the culture 
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medium slows the cleavage conversion to p-collagens V (Kumamoto 
and Fessler, 1981; Fessler et al., 1981b). The kinetics of secretion from 
corneal cells have been studied (Kao et al., 1983), and a secretion time 
of about 100 min was estimated for rhabdomyosarcoma cells (Alitalo et 
al., 1982). Presumably, cleavage to p-collagens V occurs at the cell 
surface or extracellularly, and cell culture fluids contain enzymes that 
cleave procollagens V (Fessler et al., 1981b; Alitalo et al., 1982; Nara- 
yanan and Page, 1983). 

In whole chick tendons, conversion to p-collagens V occurs about 60 
min after completion of polypeptide synthesis (Fessler et al., 1986b). 
The secreted procollagens V contain trimeric molecules of different 
chain compositions, e.g., homotrimers or heterotrimers consisting of 
two or three different pro«(V) chains. During the conversion to p- 
collagen V, the constituent chains are cut at different rates (Kuma- 
moto and Fessler, 1981; Fessler et al., 1981b, 1986b), and overall the 
proal(V) chains are cut about three times faster than the proal’(V) 
chains. Thus, several forms of trimeric molecules will exist in which 
one or more polypeptide strands are cut without loss of cleavage prod- 
ucts from the original procollagen V molecule. Equivalent intermedi- 
ates have been described in the conversion of procollagen I to p-col- 
lagen I (Davidson et al., 1977; Morris et al., 1979). However, there are 
significant differences in the processing of procollagens V and pro- 
collagen I, and additional changes must occur. We shall first describe 
the changes observed for each proa(V) chain, and then discuss the 
possible three-chained molecular forms. 

The smaller, 35 kDa noncollagenous peptide is split off the proal(V) 
and proal’(V) chains. Presumably, this is the carboxyl propeptide. The 
resulting pal(V) and pal’(V) chains are not disulfide linked to any 
other chains. This processing of pro«l(V) occurs both in chick tissues 
(Kumamoto and Fessler, 1981; Fessler et al., 1985a) and in human cell 
cultures (Alitalo et al., 1982; Narayanan and Page, 1983; Bloch et al., 
1986). The conversion of prow2(V) to pa2(V) may be slightly different 
in human and chick. In human gingival cell cultures, three chains of 
the «2(V) series were found: 174, 157, and 132 kDa. Presumably, these 
correspond to the proa2(V), pa2(V), and «2(V) chains, though this se- 
quential relationship was not established (Narayanan and Page, 
1983). In chick, the reduced forms of the proa2(V) and pa2(V) chains 
have the same electrophoretic mobility and give rise to the same non- 
collagenous peptide upon digestion with bacterial collagenase. Thus, 
the peptide structure of chick proa2(V) is presumably unchanged, and 
the carboxyl propeptide is retained in pa2(V). However, both elec- 
trophoretic and velocity sedimentation analysis of the pa2(V) chain 
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without reduction show it to be larger than the original proa2(V) 
chain. This is because more than one copy of an approximately 30 kDa 
peptide has become disulfide linked to the carboxyl propeptide of 
pa2(V). Reduction releases these peptides, which were called P-pep- 
tides, and yields the pa2(V) chain. Collagenase digestion of the non- 
reduced pa2(V)—P-peptide complex gives the carboxyl—a2(V) peptide 
disulfide linked to the P-peptides (Kumamoto and Fessler, 1981; Fes- 
sler et al., 1981a). 

The most likely sources of the P-peptides are the cleaved-off carbox- 
yl propeptides of proal(V) and proal’(V) chains. The P-peptides elec- 
trophorese slightly faster than the presumed carboxy] propeptides pre- 
pared from proal(V) chains by digestion with bacterial collagenase. 
This is consistent with an expected difference between the likely phys- 
iological site of cleavage, in a telopeptide region, and the place where 
bacterial collagenase will stop digestion when it reaches the end of the 
collagenous sequence. Both carboxyl propeptides and P-peptides carry 
N-linked carbohydrate, and both become labeled during incubation 
with radioactive amino acids. However, if the P-peptides are derived 
from the carboxyl propeptides of adjacent proal(V) and proal'(V) 
chains, then an increase of interchain disulfide bonding must accom- 
pany the procollagen V to p-collagen V transition. The P-peptides 
could be carboxyl propeptides which are unable to undergo disulfide 
linkage to neighboring carboxyl propeptides as long as they retain 
continuity with the main collagenous peptide portion and, only after 
cleavage, could undergo some rearrangement needed for disulfide link 
formation. However, when the proal(V) chains of isolated procollagen 
V were cleaved to pal(V) with the enzymes of fibroblast culture media, 
none of the resulting pa2(V) chains had P-peptides attached (Fessler et 
al., 1981b). It is not necessary that all P-peptides originally belonged to 
the set of chains of one trimeric molecule. They could be derived from 
an adjacent, juxtaposed molecule. These considerations apply equally 
if the nroal(V) and proa2(V) chains within one molecule had an anti- 
parallel arrangement. 

The conversion of p-collagens V to collagens V only occurs slowly, 
over a number of hours, in chick tissues incubated in vitro (Fessler et 
al., 1981a,b, 1985a). This conversion is strikingly different from the 
corresponding change in the interstitial p-collagens I, II, and III. Only 
the distal portion of the carboxyl propeptide of pa2(V) is cut off, so that 
an approximately 20 kDa noncollagenous peptide remains attached to 
the main collagen region. With this change, the P-peptides are lost, 
and they are, presumably, disulfide linked to the distal portion of the 
carboxyl propeptide. The presumed amino propeptides of pal(V) and 
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pal’(V) also have their distal portions removed, leaving substantial 
peptides of, respectively, 45 and 30 kDa that are resistant to bacterial 
collagenase and remain continuous with the main collagen regions. 
Most or all of the sulfated tyrosine residues are retained during this 
conversion. While native procollagen V and p-collagen V molecules 
could not be distinguished by velocity sedimentation, the collagen V 
molecules formed by the above changes sedimented more slowly. This 
implies that while the conversion from procollagen V to p-collagen V 
may primarily represent nicks in peptide strands and changes in disul- 
fide linkage of the originally secreted triple-chained molecule, the 
further change to collagen V removes some peripheral, noncollagenous 
portions at both ends of this molecule. At the time that this change 
occurs, these molecules have been a part of the extracellular matrix 
for a relatively long time. 

Further, slow processing may occur at times longer than the 20 hr of 
organ explants maintenance. Extracts of chick embryo tendons show 
two corresponding smaller electrophoretic bands which chro- 
matograph with collagen V and stain with Coomassie Blue but do not 
incorporate radioactive amino acids during 20 hr of incubation. The 
nature of these materials has not yet been established. From lathyritic 
chick tendon (Jimenez et al., 1978; Broek et al., 1985), bone, and crop 
(Broek et al., 1985), collagen V has been extracted that probably corre- 
sponds to this material. The lathyritic «1(V) and «2(V) chains contain, 
respectively, 18 and 29 kDa noncollagenous regions. This lathyritic 
collagen V was then associated into segment long spacing (SLS) 
crystallites that had the same 300-nm-long banded region as that 
made from collagen V (pepsin) and had an additional 25-nm domain. 
Comparison with SLS crystallites of collagen I indicated that the addi- 
tional domain is atthe amino-terminus. In rotary shadowed molecules, 
the 304-nm collagen thread had a knob at one end, and several col- 
lagen molecules were connected by one large knob (Broek et al., 1985). 
Similar collagen V fractions were isolated from calf skin (Elstow and 
Weiss, 1983) and from lightly pepsin-digested placenta (Adachi and 
Hayashi, 1985). An unresolved paradox is that the single amino do- 
main of collagen V molecules seen in the electron microscope is pre- 
sumed to contain the noncollagenous regions of both the a1(V) and the 
a2(V) chains. But the single, substantial noncollagenous domain of 
chick prow2(V) (Kumamoto and Fessler, 1981; Fessler et al., 1981a) is 
assigned as a carboxy] propeptide because of the existence of a carbox- 
yl propeptide of this size in a human cDNA sequence (Myers et al., 
1985b). Thus, this noncollagenous region of prow2(V) and its processed 
derivative in w2(V) are assigned to opposite ends of one molecule. It is 
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not established what the end form of type V collagen molecules is, and 
there could be more than one form in different tissues. 


VII. LOCATION OF COLLAGENS V IN THE 
EXTRACELLULAR MATRIX 


Immunofluorescence microscopy shows, in a wide variety of tissues, 
that collagen V occurred both in association with basement mem- 
branes and, quite separately, in fibrous stroma such as tendon and 
cornea. Recently, the basement membrane localization has been ques- 
tioned, and identification of collagen V as an interstitial fibrous col- 
lagen has been suggested (Schuppan et al., 1986). Investigation of the 
relationship of collagen V to basement membranes requires resolution 
with the electron microscope. Initial immunoelectron microscopy by 
the ferritin—ultrathin frozen section technique showed colocalization 
of collagens V and IV in three different basement membranes of the 
mouse kidney, and their occurrence in parts of the mesangium as well 
(Roll et al., 1980; Furthmayr et al., 1982). Different results were ob- 
tained using immunoelectron microscopy with cross-adsorbed anti- 
bodies and the peroxidase technique. The results suggested that type V 
collagen is present in the rat renal interstitium in different forms: in 
close apposition to interstitial collagen fibers, in the stromal aspect of 
vascular basement membranes, and in a particulate material not 
bound to other structures. Type V collagen was specifically noted as 
being absent from epithelial and glomerular basement membranes. It 
was sometimes found on only the stromal aspect of vascular smooth 
muscle as well as in between collagen bundles as part of the matrix. A 
binding or connecting function was postulated for collagen V (Mar- 
tinez-Hernandez et al., 1982). In a third study, application of the im- 
munogold-labeling technique to human amnion showed type V col- 
lagen in the immediate vicinity of basement membranes but distinctly 
separate from the locations of laminin and collagen IV. Unbanded 
fibrils of 12 nm were stained. These were separate from the striated, 
67-nm fibers that were labeled by antibodies to collagen I, though 
these different fibrils were enmeshed. An anchoring function between 
membranes and stromal matrix was postulated for these 12-nm col- 
lagen V fibrils (Modesti et al., 1984). 

The relationship of collagen V to muscle cells is of particular in- 
terest. During formation of myotubes in vitro, myoblasts make small 
amounts of collagen V, but pronounced expression of collagen V occurs 
when they have fused (Bailey et al., 1979; Sasse et al., 1981; von der 
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Mark and Öcalan, 1982; Nusgens et al., 1986). Collagen V is primarily 
found in the endomysium of skeletal muscle (Duance et al., 1977; 
Bailey et al., 1979) and was undetected within the synaptic regions of 
muscle basement membranes (Sanes and Cheney, 1982). In contrast, 
cultures of smooth muscle cells synthesize and secrete collagen V over 
the entire plasma membrane (Gay and Miller, 1978), but within 
tissues collagen V tends to be associated with only the stromal aspects 
of the cell surface (Gay et al., 1981b; Miller et al., 1982). 

Collagen V has also been reported peripherally adjacent to capillar- 
ies, at the surface of blood vessels denuded of their endothelial lining 
(Keremyi et al., 1985), and in atherosclerotic plaques (McCullagh et al., 
1980; Ooshima, 1981; Shekhonin et al., 1985). The overall impression 
and interpretation of such findings is that, in these locations, collagen 
V has one or more functions as a connector between basement mem- 
brane and stroma. 

Partly in contrast to such functions is the concept of collagen V as a 
component of extended collagen fibers. Purified collagens V (pepsin) 
can be precipitated as thin fibrils (Chiang et al., 1980; Broek et al., 
1985), even with 67-nm periodicities (Barnes et al., 1980; Adachi and 
Hayashi, 1985). The central collagen domain has the same 300-nm 
length in collagens I and V (Kühn, 1982; Bächinger et al., 1982; Elstow 
and Weiss, 1983; Broek et al., 1985). Monoclonal antibodies against 
collagens V (pepsin) have been made (von der Mark and Öcalan, 1982; 
Linsenmayer et al., 1983a; Mayne et al., 1984b), and it was found that, 
at the light-microscopic level, the epitopes appeared to be masked in 
tissues by association with collagen I. They were unmasked by acetic 
acid swelling of the fibers, low temperature dissolution of lathyritic 
material, and cleavage of collagen I with vertebrate collagenase (Lin- 
senmayer et al., 1983a, 1984, 1985; Fitch et al., 1984). These data 
suggested two separate models for fibers containing more than one 
collagen type: in one model the different components are intimately 
mixed throughout the whole cross section of a fiber, and in the other 
there are regional condensations of the separate types. In the latter 
case the type V collagen contributions would be located more centrally 
within a fiber. Either combination is likely to imbue a fiber with 
special properties (Linsenmayer et al., 1985). In addition to bio- 
mechanical properties such as tensile strength and flexibility of a 
fiber, such combinations could also influence final fibril diameter and 
the processes of fibrillogenesis (Adachi and Hayashi, 1986). Fibers 
which contain collagen V in the chick embryo cornea only appear dur- 
ing a second stage of fiber formation; therefore, in this tissue the 
collagen V does not seem to be involved in the additional biological 


94 JOHN H. FESSLER AND LISELOTTE I. FESSLER 


problem of how the first lead fibers are laid down. Polyclonal anti- 
bodies against collagen V (pepsin) indicated that, at the electron-mi- 
croscope level, this material was on or close to collagen fibrils of the 
stroma of human cornea and in Bowman’s layer (Nakayasu et al., 
1986). 

In chick embryo tendon, the proa1’(V) chain is only made within the 
tendon proper and not by the cells of the tendon sheath, which do, 
however, make proal(V) and proa2(V) chains (Fessler et al., 1985a). 
The proa3(V) chain is primarily found in placenta and in some other 
tissues (Brown et al., 1978; Sage and Bornstein, 1979; Brown and 
Weiss, 1979; Abedin et al., 1981; Morton and Barnes, 1982; Leushner, 
1983; Leushner and Haust, 1984). These occurrences could be further 
indications of specialization of collagen V fiber formation for particu- 
lar regions. 

The potentially intimate molecular association of collagens V with 
collagen I is a property reminiscent of collagen III, which probably also 
is rarely, if ever, found alone. The presence of amino propeptides on 
pN-collagens I and III clearly influences the size of the fibers deposited 
(Fleischmajer et al., 1981, 1985; Hulmes, 1983), and a similar role for 
pN-collagen V may be postulated. Both procollagens V and III are also 
only slowly processed at their amino ends (Fessler et al., 1981c), and 
both have sulfated tyrosine residues (Fessler et al., 1986a,b; Niemela 
et al., 1985; Jukkola et al., 1986). There is an impression that both 
collagen types are also associated with fibers of thinner diameter, but 
it is probably premature to generalize. 


VIII. OTHER PROPERTIES OF COLLAGENS V 


In addition to biomechanical contributions, collagen V (pepsin) has 
been tested for its interactive properties with cells and other compo- 
nents of the extracellular matrix. It has been reported to promote cell 
attachment and migration (Stenn et al., 1979; Grotendorst et al., 1981), 
and specifically cyanogen bromide peptide CB6 of the a2(V) chain 
serves as a binding site to some plasma membrane glycoproteins (Leu- 
shner and Haust, 1985). Type V collagen interacts preferentially, rela- 
tive to other collagens (pepsin), with thrombospondin (Mumby et al., 
1984). The extensive literature on the interaction of collagens with 
platelets, which includes interaction with collagens V, (e.g., Chiang et 
al., 1980; Barnes et al., 1980), and the role of collagens in blood clot 
formation will not be reviewed here. 

The collagen helix of collagens V is resistant to the vertebrate col- 
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lagenases which cleave collagens I and III in a characteristic manner 
(Harper, 1980). Other enzymes, with various degrees of specificity for 
collagens V (pepsin), have been described (Mainardi et al., 1980, 1984; 
Liotta et al., 1981; Murphy et al., 1981, 1982, 1985; Hibbs et al., 1982; 
Rich et al., 1983; Haralson et al., 1984). A class of these neutral metal- 
loproteinases, termed “gelatinases,” specifically degrade collagen V in 
addition to denatured collagens (Murphy et al., 1981, 1982; Mainardi et 
al., 1983; Hibbs et al., 1985), and gelatinases are often present during 
matrix degradation together with the interstitial collagenases (Seltzer 
et al., 1981; Harris and Krane, 1972; Sopata and Dancewiez, 1974), 
suggesting that they may act synergistically with the classical col- 
lagenases during matrix remodeling. The regions flanking the col- 
lagen V helix must be important for maintaining the state of insol- 
ubility of collagens V in tissues from which they are extracted after 
digestion with pepsin. It is likely that some of the pathologically deg- 
radative enzymes might act at such sites. There may be turnover of 
collagens V associated with inflammatory conditions and wound heal- 
ing (Hering et al., 1983; Kurita et al., 1985). The synthesis of collagen 
V by gingival cells is increased during inflammation, but this condi- 
tion does not become an inherent property of the cells that is main- 
tained in culture (Narayanan and Page, 1983, 1985). 


IX. OCCURRENCE OF COLLAGENS V 


We hope that the following list of locations in which collagens V 
have been found or investigated in normal adult and embryonic 
tissues and in some pathological conditions will help the reader, and 
we apologize to the authors of any additional publications that we may 
have omitted. 

Vascular tissues and cell cultures: Chung et al. (1976), Sage et al. 
(1979, 1981c), Sage (1982), Alitalo et al. (1980), McCullagh et al. 
(1980), Roll et al. (1980), Fessler et al. (1981a), Sankey et al. (1981, 
Sankey and Barnes (1984), Borg et al. (1982), Tseng et al. (1982b), 
Morton and Barnes (1982, 1983), Bartholomew and Anderson (1983), 
Bellon et al. (1985), Keremyi et al. (1985), and Jerdan and Glaser 
(1986). 

Kidney: Roll et al. (1980), Martinez-Hernandez et al. (1982), Bar- 
tholomew and Anderson (1983), Foidart et al. (1983), and Bonadio et al. 
(1984). 

Lung: Madri and Furthmayr (1979, 1980), Haralson et al. (1980), 
Reiser and Last (1983), Konomi et al. (1984). 
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Liver and hepatocyte cell cultures: Biempica et al. (1980), Hatahara 
and Seyer (1982), Rojkind et al. (1983), Tseng et al. (1983), Murata et 
al. (1984), Yamamoto et al. (1984), Hata et al. (1985), and Schuppan et 
al. (1986). 

Pancreas: Ingber et al. (1985). 

Placenta and uterus: Burgeson et al. (1976), Sage and Bornstein 
(1979), Hong et al. (1979), Foidart et al. (1981), Rhodes and Miller 
(1981), Abedin et al. (1982), Madri et al. (1982), Mandell and Sodek 
(1982), Modesti et al. (1984), and Niyibizi et al. (1984). 

Amniocytes: Alitalo et al. (1980). 

Chinese hamster ovary cells: Limeback et al. (1982). 

Mammary gland: Warburton et al. (1983). 

Gingiva, oral mucosa, periodontal tissue and tooth development: 
Sodek and Mandell (1982), Sodek et al. (1982), Narayanan and Page 
(1983, 1985), Narayanan et al. (1983), Limeback et al. (1983), Silver et 
al. (1984), Kurita et al. (1985), Dabbous et al. (1981), and Schuppan et 
al. (1986). 

Tendon, skin, and bone: Chung et al. (1976), Jimenez et al. (1978), 
Laurain et al. (1980), Herrmann et al. (1980), Fessler et al. (1981b, 
1985a), Aubin eż al. (1982), Linsenmayer et al. (1983a), von der Mark 
and Ocalan (1982), Elstow and Weiss (1983), Kenney et al. (1983), 
Woodley et al. (1983), Weber et al. (1984), Konomi et al. (1984), Broek et 
al. (1985), and Smith et al. (1986). 

Cornea: Linsenmayer et al. (1985), Davison et al. (1979), Yue et al. 
(1979, 1983), Conrad et al. (1980), Pöschl and von der Mark (1980), 
Welsh et al. (1980), Kao and Foreman (1980), Kao et al. (1982), Tseng et 
al. (1981, 1982a), Sankey et al. (1981), Sankey and Barnes (1984), 
Newsome et al. (1982), von der Mark and Öcalan (1982), and Nakayasu 
et al. (1986). 

Keratinoctyes: Birkedal-Hansen et al. (1982) and Prunieras et al. 
(1983). 

Nervous system: Shellswell et al. (1979), Maxwell et al. (1984), and 
Salonen et al. (1985a). 

Smooth muscle and smooth-muscle cell cultures: Chung et al. (1976), 
Mayne et al. (1978), McCullagh et al. (1980), Fessler et al. (1981a), 
Bartholomew and Anderson (1983), Leushner and Haust (1984), Bellon 
et al. (1985), and Schuppan et al. (1986). 

Skeletal muscle and cell cultures: Duance et al. (1977, 1980), Bailey et 
al. (1979), Shellswell et al. (1980), John and Lawson (1980), Sasse et al. 
(1981), von der Mark and Ocalan (1982), Linsenmayer et al. (1983b), 
Sanes and Cheney (1982), Stephens et al. (1982), Salonen et al. (1985b), 
DeMichele et al. (1985), and Nusgens et al. (1986). 
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Tumors: 

Breast carcinoma: Barsky et al. (1982), Gordenne et al. (1982), and 
Van Cauwenberge et al. (1983). 

Osteosarcoma: Shapiro and Eyre (1982) and Yamagata and Yama- 
gata (1984). 

Neurofibroma: Peltonen et al. (1981, 1984) and Shinkai (1984). 

Rhabdomyosarcoma: Krieg et al. (1979), and Alitalo et al. (1982). 

Basal cell carcinoma: Van Cauwenberge et al. (1983). 

Mesothelial cells: Harvey and Amlot (1983). 

Dedifferentiated chondrocyte cultures: Benya et al. (1977, 1978) and 
Mayne et al. (1984a). 

Invertebrate (jellyfish): A mesoglial collagen was suggested to be like 
type V, (Miura and Kimura, 1985). 


X. CONCLUSIONS 


The presence of collagen V in the vicinity of basement membranes 
and in collagen fibers suggests that it can act as a linker and also 
contribute to fiber structure. This collagen has several a chains in 
different trimeric combinations. It is not known whether these various 
type V molecules have different physiological functions. The existence 
of the related 1a and 2a chains in cartilage (see Eyre and Wu, this 
volume) and of the a1'(V) chain in tendon suggests that there could be 
tissue-specific adaptation. Significant noncollagenous regions are 
maintained from the procollagens V and are only slowly cut off. These 
noncollagenous peptides must influence the interactions of the newly 
made p-collagens V with other components of the extracellular matrix. 
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I. INTRODUCTION 


A new protein now known to be a fragment of collagen VI (Engel et 
al., 1985; Rauterberg et al., 1986) was first described about 10 years 
ago. It soon became clear that collagen VI possesses several unique 
properties which distinguish this protein from any other type of col- 
lagen known so far. One unusual characteristic is the rather short 
triple-helical domain which accounts for less than half of the total 
mass of the protein. Another unique feature is the assembly of col- 
lagen VI monomers into well-defined oligomers which are the building 
blocks of microfibrils found in tissues and cell cultures and are now 
the best-characterized microfibrillar structures existing in the extra- 
cellular matrix (for a review on microfibrils, see Cleary and Gibson, 
1983). The present article will review the data which led to the under- 
standing of the structure and macromolecular organization of collagen 
VI and, in addition, the data related to its biosynthesis, immunology, 
and tissue distribution. 


Lines of Discovery 


Chung et al. (1976) were the first to report on the isolation of a large, 
disulfide-bonded collagen complex from a pepsin digest of the intimal 
layer of human blood vessels. This material had a unique amino acid 
composition, was rich in carbohydrate, and could be split into peptide 
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chains with a M, of about 50,000 by reduction. Similar or identical 
protein complexes were subsequently prepared from cirrhotic liver 
(Rojkind et al., 1979), human and bovine placenta (Furuto and Miller, 
1980; Risteli et al., 1980; Jander et al., 1981), calf skin (Laurain et al., 
1980), and uterus (Abedin et al., 1982). This material was originally 
referred to as high-molecular-weight aggregates, short chain collagen, 
or intima collagen. Peptide map and N-terminal amino acid sequence 
analyses (Jander et al., 1983) as well as electron microscopy (Furth- 
mayr et al., 1983) demonstrated no relationship of the protein to the 
collagens I-V and led to the proposal to name this new protein col- 
lagen type VI. 

It is now known that pepsin solubilization of collagen VI causes the 
loss of about 50% of its original mass. Larger and presumably intact 
forms of collagen VI chains were also studied at the same time, but 
their relationship to pepsin-solubilized collagen VI was not immediate- 
ly recognized. Sear et al. (1978, 1981a) had characterized a M, = 
150,000 polypeptide chain (termed MFP I), from nuchal ligament cell 
cultures, which was considered to be a subunit of the microfibrils 
associated with elastin. A similar, disulfide-bonded polypeptide 
(GP-140) with a M, of 140,000 was identified in the detergent-insolu- 
ble matrix of cultured skin and lung fibroblasts (Carter and Hako- 
mori, 1981; Carter, 1982a). Both polypeptides contained significant 
amounts of hydroxyproline and hydroxylysine, indicating that they 
originated from a collagenous protein. Glycoproteins of similar size 
and composition were also found in urea or guanidine—HC] extracts of 
muscle (Marton and Arnason, 1982) and aorta and ligamentum nuchae 
(Gibson and Cleary, 1982) and were, after reduction of disulfide bonds, 
obtained in purified form. A possible relationship between these col- 
lagen-like glycoproteins and pepsin-solubilized collagen VI was indi- 
cated because of certain similarities in compositional features (Gibson 
and Cleary, 1982; Cleary and Gibson, 1983). 

Convincing evidence for this relationship came both from immuno- 
logical studies of tissue and cell culture-derived glycoprotein using 
antibodies against either pepsin-solubilized collagen VI (Jander et al., 
1984; von der Mark et al., 1984; Hessle and Engvall, 1984; Trüeb and 
Bornstein, 1984) or the larger, reduced polypeptide chains (Heller- 
Harrison and Carter, 1984; Gibson and Cleary, 1985), and by studies of 
the degradation of the intact form with pepsin (Knight et al., 1984; 
Ayad et al., 1985). These findings were strongly supported by electron 
microscopy of rotary shadowed specimens of nondegraded collagen VI 
(von der Mark et al., 1984; Jander et al., 1984) which demonstrated the 
same monomer-oligomer structures previously observed in prepara- 
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tions of pepsin-solubilized collagen VI (Furthmayr et al., 1983). In the 
following sections we will refer to the material consisting of M, 
110,000--140,000 polypeptides as intact collagen VI. There are a few 
more reports on similarly sized polypeptides from lung tissue (Lafuma 
et al., 1982) and cultured fibroblasts (Lehto et al., 1981; Lehto, 1983), 
but the definite establishment as collagen VI structures or fibronectin 
receptors (see Section IID) is less clear. 


II. PROTEIN CHEMISTRY AND 
MACROMOLECULAR ORGANIZATION 


A. Sources of Collagen VI 


Pepsin digests prepared from various tissues in 0.1-0.5 M acetic or 
formic acid are still the most convenient source for obtaining collagen 
VI (although not the intact molecule). Most purification procedures 
then use differential precipitation with increasing NaCl concentra- 
tions to separate collagen VI from other collagens (Furuto and Miller, 
1980). Most of the collagen VI, precipitating between 1.2 and 1.8 M 
NaCl, is redissolved in 0.1 M acetic acid or 2 M guanidine HCl, and 
precipitated again by dialysis against 0.02 M Na,HPO,. The latter 
step removes efficiently the more soluble 7 S collagen which is a cross- 
linked fragment of collagen IV (Risteli et al., 1980; Abedin et al., 1982). 
Further purification by chromatography on carboxymethylcellulose 
(CM-cellulose) or a molecular sieve column may be necessary to obtain 
a more homogenous preparation of pepsin-solubilized collagen VI 
(Furuto and Miller, 1980; Jander et al., 1981; Odermatt et al., 1983). 

The complete purification of intact collagen VI has been only accom- 
plished with its constituent polypeptide chains (Gibson and Cleary, 
1982, 1985; Marton and Arnason, 1982; Carter, 1982a; Jander et al., 
1984; Knight et al., 1984). Here, tissues or cell cultures are extracted 
with 6 M guanidine HC] or 6-8 M urea in the absence or presence of a 
reducing agent, and the extracted proteins are subsequently reduced 
and/or alkylated. Further purification steps usually include ion ex- 
change and molecular sieve chromatography or preparative gel elec- 
trophoresis. Carbohydrate affinity chromatography on phenylboro- 
nate—agarose was also used with success (Gibson and Cleary, 1985). 
The purification of intact collagen VI in a nonreduced triple-helical 
form has been attempted several times (Carter, 1982a; Jander et al., 
1984; von der Mark et al., 1984; Knight et al., 1984; Gibson and Cleary, 
1985), but the final products were not pure as judged by electro- 
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phoresis. They were nevertheless useful in electron microscopic and 
biological studies. Electrophoretically pure material was recently ob- 
tained from bovine uterus after molecular sieve chromatography in 
SDS (Trüeb and Winterhalter, 1986). 


B. Collagen VI Monomers and Constituent Chains 


Intact collagen VI monomers appear in electron micrographs after 
rotary shadowing as dumbbell-shaped molecules with two globular 
domains of about equal size connected by a 105-nm-long triple-helical 
rod (Figs. 1 and 2). These monomers are rarely seen in tissue extracts 
(von der Mark et al., 1984) but are efficiently generated from oligo- 
mers by reduction of disulfide bonds under nondenaturing conditions 
(Jander et al., 1984). Similar structures have been produced from pep- 
sin-solubilized collagen VI by partial reduction (Furthmayr et al., 
1983), but these monomers appear to have lost almost completely one 
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Fic. 1. Models of collagen VI monomers, dimers, and tetramers and of a dimeric 
fragment COL produced by bacterial collagenase. The relative size of the globular do- 
mains in intact collagen VI are denoted by dotted circles. They are degraded by pepsin to 
smaller structures indicated by full lines. Oligomer formation allows the distinction 
between outer (a) and inner (b) globular domains. The triple-helical domain (represented 
by a straight or twisted full line) was found to be the same in intact and pepsin- 
solubilized collagen VI. The relative positions of cysteine residues essential for inter- 
molecular cross-linking are denoted by S. Dimensions between arrows are given in 
nanometers. 
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Fic. 2. Rotary-shadowing images of intact collagen VI (A) obtained from a NaCl 
extract of neurofibroma and of pepsin-solubilized collagen VI (B) obtained from human 
placenta. Selected monomers (M), dimers (D), open (oT) and closed (cT) tetramers and 
larger aggregates (A) are indicated for better identification. Bar, 100 nm. Reproduced 
with permission from Engel et al. (1985). The original studies were described by von der 
Mark et al. (1984) and Furthmayr et al. (1983). 
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globular domain and significant portions of the other (Fig. 3). The 
molecular mass of intact collagen VI monomers has not yet been estab- 
lished by ultracentrifugation, but estimates based on the size of its 
chains (see below) indicates a M, in the range 330,000—420,000. Val- 
ues of 340,000 and 380,000 have been reported based on domain analy- 
sis (Aumailley et al., 1985) and electrophoretic mobility (Gibson and 
Cleary, 1985) of partially reduced collagen VI monomers. Monomers 
obtained from pepsin-solubilized collagen VI when reduced show a M, 
of 170,000 by ultracentrifugation (Furthmayr et al., 1983). Together 
with data on the conformation of the protein and chain size (see be- 
low), it appears that collagen VI, like other collagens, is a triple- 
stranded molecule in which each constituent chain very likely spans 
the length of the entire molecule in a single direction. 

Total reduction of disulfide bonds in intact collagen VI followed by 
denaturation produces unfolded polypeptide chains which migrate as a 
single band or occasionally as a close doublet of bands in SDS elec- 
trophoresis. These polypeptides are now referred to as a(VI) chains 
and show, after purification, a unique amino acid composition (Table I) 
characterized by 15—30 residues per 1000 of 4-hydroxyproline, hydrox- 
ylysine, tyrosine, and half cystine, and by a glycine content distinctly 
less than 330 residues per 1000. Molecular mass estimates were made 
from electrophoresis runs and consistently showed M, values of 
110,000 (using collagenous proteins) and 140,000 (using globular pro- 
teins as standards) (Gibson and Cleary, 1982; Carter, 1982a; Knight et 
al., 1984; Jander et al., 1984; von der Mark et al., 1984). The presence 
of two or three distinct «(VI) chains in these materials can be antici- 
pated from studies with pepsin-solubilized collagen VI, but their sepa- 
ration has not yet been reported. The splitting of the M, 140,000 band 
observed in immunoblotting (Trüeb and Bornstein, 1984) was in- 
terpreted as indicating the separation of al(VI) and a3(VI) chains. 

More recent studies provided, however, convincing evidence that 
collagen VI from uterus consists of stoichiometric amounts of «1(VI) 
and «2(VI) chains with a M, of 140,000, and a distinctly larger «3(VI) 
chain with a M, of 200,000 (Trüeb and Winterhalter, 1986). These 
chains differ in amino acid composition and patterns of collagenase- 


Fic. 3. Molecular images of collagen VI monomers (A), half-open dimers (B), and 
fragment COL (C) as visualized by rotary shadowing. Representative particles in B and 
C are indicated by arrows. The materials were prepared from pepsin-solubilized collagen 
VI of human placenta either by partial reduction under nondenaturing conditions (A 
and B) or by treatment with bacterial collagenase (C) (Odermatt et al., 1983; Furthmayr 
et al., 1983). Bar, 100 nm. Reproduced with permission from Engel et al. (1985). 
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resistant peptides. Similar results were also obtained for collagen VI 
from human lung (H. von der Mark, unpublished). The large form of 
the a3(VI) chain may have escaped detection in previous studies due to 
it lability against proteolysis or to the failure of specific antibody 
reactions in immunoblotting. The presence of a M, 200,000 component 
has been, in fact, noted in earlier studies of intact collagen VI (Jander 
et al., 1984; Gibson and Cleary, 1985) but was not interpreted at this 
time to represent a constituent chain. 

More complex chain patterns with two to more than four elec- 
trophoretic bands with M, of 40,000—70,000 were observed after total 
reduction of pepsin-solubilized collagen VI (Furuto and Miller, 1980; 
Rojkind et al., 1979; Laurain et al., 1980; Abedin et al., 1982; Jander et 
al., 1981, 1983; Odermatt et al., 1983; Triieb and Bornstein, 1984). 
Several of the major chain fragments were purified, indicating the 
presence of a single acidic chain and two more basic chains in the 
molecule. These chain fragments were named SC1 (acidic) and SC2 
and SC3 (basic); these were equated with chain fragments originating 
from al(VI), a2(VI) and a3(VD, respectively (Jander et al., 1981, 
1983). Since the SC fragments can usually be separated by SDS elec- 
trophoresis, it indicates that the individual a(VI) chains are degraded 
to a different degree by pepsin. Further limited heterogeneity may 
arise by nonuniform cleavage within single chains (Furuto and Miller, 
1980; Odermatt et al., 1983) and can make the interpretation of chain 
patterns difficult. Ultracentrifugal and molecular sieve analyses indi- 
cated, however, that the major chain fragments have a molecular mass 
in the range 43,000-53,000 (Furuto and Miller, 1980; Odermatt et al., 
1983). Their composition is characterized by a strong increase in 
glycine, hydroxyproline, and hydroxylysine content (cf. Table I) which 
indicates, in agreement with electron microscopic data, extensive but 
not complete loss of globular domain segments. Reduction of pepsin- 
solubilized collagen VI produces a small amount (5-10%) of other pep- 
tides which are smaller than the major chain fragments and show a 
noncollagenous composition. Their origin from the globular domains 
or from other proteins is not clear. 

The complete degradation to the triple helical domain of collagen VI 
requires reduction of disulfide bonds under nondenaturing conditions 
followed by a second pepsin treatment (Jander et al., 1981; 1983; Oder- 
matt et al., 1983). This procedure produced some further shortening of 
the chain fragments which now appear more similar in elec- 
trophoresis. The data also indicated a M, of about 120,000 for the 
triple helix, a value compatible with its length of 105 nm observed by 
electron microscopy (Furthmayr et al., 1983; Rauterberg et al., 1986). 
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Gly 
al (VI )p: Val-Xaa-Xaa -Arg-Phe-Glu-Gly-Gln-Pro-, jateu -Gly-Pro 
a2(VI)p: Leu-Glu-Ile-Pro-Gly-Pro-Arg-Gly-Pro-Xaa-Gly-Tyr-Xaa-Gly 
a3(vI)p: Ser-Ala-Lle-Ala-Gly-Val-Ala-Gly-val-Gly 


Fic. 4. Amino-terminal sequences of homologous fragments obtained from three dif- 
ferent «(VI) chains of collagen VI which constitute the triple-helical domain (Jander et 
al., 1983). The fragments were prepared from pepsin digest of human placenta which 
were then partially reduced to allow a second, limited degradation by pepsin to obtain 
o(VDp chains. Xaa denotes unidentified residues. 


Three different chain fragments could be separated after denaturation 
of the triple-helical domain [hereafter referred to as al(VI)p, a2(VDp, 
and a3(VDp] which all showed a collagenous amino acid composition 
(Table I) but differed distinctly in their N-terminal amino acid se- 
quences (Fig. 4) and in peptide maps produced with cyanogen bromide 
or proteases (Jander et al., 1983). The nearly equal ratio between the 
three different a(VI)p chains was interpreted to indicate a hetero- 
trimer composition, ala2a3, for intact collagen VI (Jander et al., 
1983). Direct proof for this composition, i.e., by chain cross-linking 
experiments, is still lacking. 

Amino acid sequence data are so far only available for short N- 
terminal segments (see Fig. 4) of a(VDp chains from human and 
bovine placenta (Jander et al., 1983) and demonstrated the presence of 
Gly-X-Y repeats which are characteristic for the collagen triple helix. 
Four cyanogen bromide peptides (27-194 amino acid residues) have 
been isolated from the acidic chain of pepsin-solubilized collagen VI, 
and all were found to contain less than one-third glycine (Furuto and 
Miller, 1980). This could indicate the presence of short interruptions of 
the Gly-X-Y repeats in the triple helix of collagen VI similar to those 
found in collagens IV and IX. Sequence data for cDNA corresponding 
to mRNA of collagen VI are not yet available, but work is in progress 
in several laboratories. 

Physical properties of the triple helix were studied with pepsin- 
solubilized collagen VI which contains up to 60-80% collagenous se- 
quences. This material shows a strong negative optical rotation [(@)313 
= -1130°] (Furuto and Miller, 1981) and characteristic signals in its 
CD spectrum (8s99nm = — 43,000 and 8,00,m = 3700° - em?/dmol) (Oder- 
matt et al., 1983) which are comparable in their magnitude to those of 
other collagenous proteins. Monomers prepared from this fragment by 
reduction under nondenaturing conditions show a sharp melting pro- 
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file of the triple-helix with T,,, = 32-35°C (Jander et al., 1981; Furuto 
and Miller, 1981; Odermatt et al., 1983). These observations, together 
with electron microscopic measurements (Hofmann et al., 1984), dem- 
onstrated that collagen VI possesses a typical triple helix with a com- 
parable rigidity (persistence length) to those of interstitial collagens. 
Based on reduction data, Furuto and Miller (1981) proposed a different 
structure where only two constituent chains were involved in triple- 
helical folding while the third chain was associated in a random-coiled 
fashion. This model could not explain the high protease resistance 
found for pepsin-solubilized collagen VI and was not supported by 
subsequent studies (Odermatt et al., 1983; Jander et al., 1983). 

When compared to the triple-helical domain less is known about the 
structure of the globular domains of collagen VI. A M, of 150,000 was 
estimated for each globule based on electron microscopic comparison 
with other globular structures (Engel et al., 1985) which is in reason- 
able agreement with mass estimates based on collagenase-resistant 
fragments (Aumailley et al., 1985). The globular domains are very 
likely to be different as indicated by differences in pepsin resistance 
(see Fig. 3) and topological orientation in collagen VI dimers (inner 
and outer globules) (Section II,C). It is also very likely that these 
globular domains are folded from either C- or N-terminal segments 
within the a(VI) chains, but this assignment has not yet been made. 
Controversial data exist about the size and number of noncollagenous 
segments resistant to bacterial collagenase which apparently con- 
stitute the globular domains of intact collagen VI. Studies with fibro- 
blast-derived material demonstrated either several bands with a M, of 
30,000—50,000 (Sear et al., 1981a,b; Aumailley et al., 1985), a particu- 
larly large fragment of M, about 120,000 (Trüeb and Bornstein, 1984), 
or failure to detect large fragments (Heller-Harrison and Carter, 
1984). The latter two results are not compatible with the general struc- 
tural model and may be caused by either incomplete cleavage due to 
incomplete reduction (Aumailley et al., 1985) or the failure to detect 
such fragments by immunoblotting. 

Three different fragments were consistently found in collagenase 
digests of reduced a(VI) chain (M, = 140,000 species) prepared from 
tissues with M,s of about 35,000, 55,000, and 70,000 (Marton and 
Arnason, 1982; Jander et al., 1984; Gibson and Cleary, 1985). These 
fragments were separated and found to lack hydroxyproline and to 
differ in amino acid composition and cyanogen bromide patterns (Gib- 
son and Cleary, 1985). It is not clear whether they represent homoge- 
neous peptide segments since some six different peptides would be 
expected to arise after collagenase cleavage of the three a(VI) chains. 
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They appear to originate from al(VI) and a2(VD chains, while a M, 
160,000 fragment has been reported for the a3(VD chain (Trüeb and 
Winterhalter, 1986). A heterogeneous mixture of noncollagenous pep- 
tides was also found after collagenase cleavage of pepsin-solubilized 
collagen VI (Odermatt et al., 1983) which were all smaller than those 
obtained from the intact protein chains. 

Collagen VI is characterized by an unusually high number (20-30) 
of cysteine residues which may be present in each a(VI) chain (Table 
D. Only 2-3 of these residues are presumably utilized to form inter- 
molecular disulfide bonds to stabilize the oligomer forms (Section 
II,C). Most are, therefore, very likely used for intramolecular or intra- 
chain bonds or exist with free sulfhydryls. The presence of intra- 
molecular bonds is indicated by the high conformational stability of 
collagen VI against denaturing agents as revealed by electron micros- 
copy (Jander et al., 1984; von der Mark et al., 1984) and by protease 
resistance (see below). Intramolecular bonds may be crucially involved 
in the stabilization of globular domains since they disappear after 
complete reduction under nondenaturing conditions of disulfide bonds 
(Furthmayr et al., 1983). Intrachain disulfide bonds very likely exist at 
least in the triple-helical domain since a(VI)p chains contain 3-6 cys- 
teine residues (Jander et al., 1983; see Table I). These residues are not 
regularly distributed as indicated by cleavage with 2-nitro-5-thio- 
cyanobenzoic acid (Triieb and Bornstein, 1984). 

The high degree of disulfide-bonding found in collagen VI endows 
the protein with a high resistance to proteolytic attack. When extract- 
ed from tissues in intact form with denaturing agents but without 
reduction, collagen VI resists digestion by bacterial collagenase but 
not digestion by pepsin. Reduction under nondenaturing or denaturing 
conditions resulted in a considerable increase in protease sensitivity 
(Gibson and Cleary, 1982, 1985; Jander et al., 1984). Collagen VI iso- 
lated from cell cultures of both medium and matrix forms was also 
stable against neutral proteases such as trypsin and thrombin and was 
a poor substrate for collagenase (Carter and Hakomori, 1981; Carter, 
1982b; Heller-Harrison and Carter, 1984; Bruns, 1984; Aumailley et 
al., 1985). A distinct cleavage by collagenase was, however, reported 
for nuchal ligament collagen VI (Sear et al., 1981a; Knight et al., 
1984). Protease stability is also a characteristic feature of pepsin-sol- 
ubilized collagen VI which, after isolation, cannot be cleaved further 
by additional exposure to pepsin, various neutral proteases, or bacte- 
rial collagenase (Furuto and Miller, 1981; Odermatt et al., 1983). Here 
again, reduction of disulfide bonds increases protease sensitivity (see 
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above). No reports exist so far on the cleavage of collagen VI by known 
tissue collagenases which are able to split a single peptide bond in 
either collagens I-III, IV, or V. 

Collagen VI is also subjected to several other posttranslational modi- 
fications besides disulfide bond formation, such as hydroxylation of 
30-50% of its proline residues within the triple-helical segment to 
form 4-hydroxyproline and of more than 70—90% of its lysine residues 
to form hydroxylysine (see Table I). More than 90% of the hydrox- 
ylysine residues are then substituted with glucosyl-galactosyl or, toa 
lesser degree, with galactosyl groups (Chung et al., 1976; Furuto and 
Miller, 1981; Jander et al., 1981, 1983). These studies also demon- 
strated substantial amounts of glucosamine but no galactosamine in 
pepsin-solubilized collagen VI, indicating the presence of N-linked 
oligosaccharide even within the triple-helical segment of the protein. 
From the reported data it can be calculated that pepsin-solubilized 
collagen VI contains 20-25% carbohydrate. Lower values, however, 
have been reported for intact a(VI) chains isolated from fibroblasts 
(2.7%; Carter, 1982a) or from nuchal ligament (6.1%; Knight et al., 
1984). The latter study demonstrated the presence of galactose, 
glucose, mannose, glucosamine, and fucose in a ratio of 33:24:25:13:5 
but only trace amounts of sialic acids. As expected from these data, 
a(VI) chains usually show strong staining with the periodic acid- 
Schiff reagent and also react with lectins including concanavalin A, 
wheat germ agglutinin and ricin (Carter, 1982b; Gibson and Cleary, 
1982; Marton and Arnason, 1982). These reactions have been fre- 
quently used to monitor purification and fragmentation of collagen VI. 
Carbohydrate incorporation was also demonstrated in biosynthetic 
and cell surface labeling studies (see Section II). 


C. Oligomeric Forms of Collagen VI 


The most unique feature of collagen VI is its assembly into well- 
defined dimers and tetramers which, in varying proportions, are the 
major particles in preparations of pepsin-solubilized collagen VI from 
human placenta visualized by rotary shadowing and negative staining 
(Fig. 2). Molecular masses and sedimentation coefficients calculated 
for models of these oligomers (Fig. 1) are in good agreement with 
ultracentrifugal data demonstrating that the dimers and tetramers 
seen by electron microscopy are indeed the most prominent compo- 
nents in these preparations (Furthmayr et al., 1983). Identical oligo- 
mers have been observed in subsequent studies using pepsin-sol- 
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ubilized collagen VI from a variety of sources (Jander et al., 1984; 
Hessle and Engvall, 1984; Fleischmajer et al., 1985; Schuppan et al., 
1985; Linsenmayer et al., 1986). 

Dimers of collagen VI are formed by a staggered, antiparallel align- 
ment of monomers with a 75-nm overlap in their triple-helical domains 
(Fig. 1). This arrangement gives rise to 30-nm-long single triple-helical 
segments at each end of the dimer and allows the distinction between 
outer and inner globular domains differing in size due to a different 
susceptibility to pepsin. Other structural features as observed in nega- 
tively stained dimers (Furthmayr et al., 1983) include a supertwisting 
(four to five turns) within the overlapping triple-helical segments and 
an egg-shaped contour of the inner globules pointing toward one side of 
the particles. Strong evidence for this antiparallel, staggered assem- 
bly came from partial reduction experiments (Odermatt et al., 1983; 
Furthmayr et al., 1983) which, after opening of a single disulfide bond, 
resulted in the appearance of asymmetric, half-open dimers (Fig. 3) as 
well as more recently from an asymmetric distribution of monoclonal 
antibodies along the inner triple helix of dimers (Linsenmayer et al., 
1986) as demonstrated by rotary shadowing (see Fig. 10). 

Tetramers of collagen VI are formed by the lateral association of 
dimers with their ends in register (Fig. 1). Stabilization of this assem- 
bly apparently occurs by a crossing-over of the outer triple-helical 
segments into a scissors-like structure at one or both ends producing 
both open and closed tetramers (Fig. 2). The frequent separation of the 
two inner triple helical domains indicates little interaction in this 
region of the tetramer. Major contributions to the stabilization of the 
tetramer (and dimer) structures are provided by 3-4 disulfide bonds 
located in different portions of the monomers (Fig. 1). Two disulfide 
bonds connect the inner globular domains to the opposite triple-helical 
domains in the dimers, and further disulfide bonds may be involved in 
stabilizing the scissors-like connections in the tetramers (Odermatt et 
al., 1983; Furthmayr et al., 1983). Noncovalent interactions between 
central regions of tetramers and dimers (supercoiled triple helix) may 
also exist, but they are apparently weak and lost after partial reduction. 

The basic features of pepsin-solubilized collagen VI oligomers were 
confirmed in electron microscopic studies of intact collagen VI ob- 
tained from neutral salt extracts of neurofibroma (von der Mark et al., 
1984) and guanidine—HCl extracts of nuchal ligament (Jander et al., 
1984). The materials consisted either mainly of tetramers together 
with some dimers and monomers (Fig. 2) or, in the latter case, mainly 
of dimers mixed with some open tetramers and end-to-end linked di- 
mers. These structures possessed the 105-nm-long triple-helical do- 
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main observed in pepsin-solubilized collagen VI plus two larger 
globular domains which, due to their size, often obscure the scissors- 
like connection at the ends of tetramers. Similar structures were re- 
cently found for collagen VI isolated from cell cultures with mono- 
clonal antibodies (Engvall et al., 1986). These intact oligomers are also 
held together by disulfide bonds as shown by the release of monomers 
after reduction under nondenaturing condtions (Jander et al., 1984). 
The number of disulfide bonds involved is not known and could be 
larger than those determined by partial reduction of pepsin-solubilized 
collagen VI (Odermatt et al., 1983). 

Oligomer formation increases both thermal stability and protease 
resistance of collagen VI. Pepsin-solubilized collagen VI showed an 
increased thermal stability compared to collagen VI monomers with 
either a clear biphasic melting profile with Tn of 42 and 62°C (Jander 
et al., 1981; Odermatt et al., 1983) or a more complex profile within the 
same temperature range (Furuto and Miller, 1981). This thermal sta- 
bility could be due to triple-helical contacts (superhelix) and/or sta- 
bilizing effects by disulfide bonds. The biphasic melting indicates dif- 
ferences in stability within a single triple-helical domain which was 
supported by isolating a defined fragment Col after treatment of heat- 
denatured (55°C), pepsin-solubilized collagen VI with bacterial col- 
lagenase (Odermatt et al., 1983; Furthmayr et al., 1983). This frag- 
ment (M, of 240,000) corresponded to the inner triple-helical domain of 
collagen VI dimers (Figs. 1 and 3) and had a T,, of 42°C. Quite often, it 
showed an unraveling of the two supercoiled triple-helical segments 
which, however, always remained connected at both ends due to di- 
sulfide bonds. Preparations of fragment Col did not contain any 
tetrameric structures or larger aggregates, demonstrating that the 
outer segments in the dimers are exclusively responsible for connect- 
ing tetramers and microfibrils. 

The data on the structure and degradation of oligomers are highly 
suggestive for a single pathway in the assembly process which very 
likely starts with the formation of staggered dimers followed by open 
and closed forms of tetramers. Some support for this suggestion is 
obtained from biosynthetic studies (Section III), but more data are still 
needed for a full understanding of these processes. 


D. Microfibrillar Assembly 


The tissue organization of collagen VI into structures larger than 
the oligomers described above was suggested by studies which failed to 
extract substantial amounts of the protein from muscle or placenta 
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Fic. 6. Comparison of molecular models of linear aggregates of collagen VI tetramers 
(a), microfibrils (b), and aggregated microfibrils (c) observed in cell cultures. The lines 
and circles in the tetramer aggregates (a) correspond to the triple-helical and globular 
domains of collagen VI monomers, respectively, and the outer globular domains are 
marked by a central dot (Furthmayr et al., 1983). Corresponding electron micrographs 
are shown in Fig. 5. The lines in the microfibril models (Bruns, 1984) represent the inner 
triple-helical domains of tetramers, while the two tapered beads represent, in an un- 
defined fashion, eight globular domains originating from two overlapping tetramers. 
Electron micrographs of single and aggregated microfibrils are shown in Fig. 7. The 
precise orientation of the globular domains within the beaded regions is unknown, and 
several models have been discussed (Rauterberg et al., 1986). Numbers between arrows 
refer to distances in nanometers. 


(Marton and Arnason, 1982; von der Mark et al., 1984) and cell ma- 
trices (Carter, 1982b) by denaturating solvents (6 M guanidine HCl, 8 
M urea) in the absence of reducing agents. Larger yields were reported 
for nuchal ligament (Gibson and Cleary, 1982), indicating some tissue 
variations. Direct evidence for larger collagen VI structures were ini- 
tially obtained for some material present in pepsin-solubilized col- 
lagen VI (Furthmayr et al., 1983) which consisted of up to six tetramers 
linked to each other in end-to-end fashion (Fig. 5). The detailed elec- 
tron microscopic analysis of these short filaments indicated an overlap 
of the outer triple-helical segments of two tetramers, shortening their 
distances between opposing inner globular domains from about 60 nm 
to either 42 nm (after negative staining) or 30 nm (after rotary shadow- 
ing). This arrangement suggested close contacts between outer globular 
domains of one tetramer with the inner globular domains of an adjacent 
tetramer (which may become disulfide-bonded) and a lateral repeat of 
this structure with a distance of 105-117 nm (Fig. 6). Such linear 


Fic. 5. Electron microscopic visualization of linear aggregates of tetramers present in 
pepsin-solubilized collagen VJ from human placenta by negative staining (a and b) or 
rotary shadowing (c). Bars, 100 nm. Reprinted by permission from Biochem. J. 211, 303-— 
311 copyright © 1983 The Biochemical Society, London. 
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aggregates of tetramers have not been observed in guanidine-HCl 
extracts of tissues which may contain, however, small amounts of 
dimers in linear association (Jander et al., 1984). Such structures may 
have arisen from filaments in which the tetramers were imperfectly 
cross-linked in their scissors-like region. 

Based on the model shown in Fig. 6, it was predicted that collagen VI 
could be a component of tissue microfibrils with a diameter of 5-10 nm 
anda periodicity of about 100 nm (Furthmayr et al., 1983). This predic- 
tion gained some support from immunoelectron microscopy of tissues 
(von der Mark et al., 1984; see Section V) and by electron microscopy of 
certain kinds of fibrils found in cornea, tendon, and skin fibroblast 
cultures (Bruns, 1984). The latter study identified negatively stained 
beaded microfibrils of 3 nm width with pairs of beads repeating at 100- 
nm intervals up to a length of 2 pm. A second structure consisted of 
bundles of laterally aligned microfibrils with all their beads in regis- 
ter (Fig. 7). The beads showed a tapered or rectangular shape and were 
often separated by a small central cleft. It was recognized by Bruns 
(1984) that the periodicity and dimensions of these fibrils closely re- 
sembled those of a linear array of collagen VI tetramers (Fig. 6). Their 
morphology was clearly different from those of collagen I precipitated 
in fibrous long spacing (FLS) form (periodicity 220-240 nm) and from 
microfibrils associated with elastin. That these fibroblast microfibrils 
contain, in fact, collagen VI was demonstrated by immunoelectron 
microscopy (Bruns et al., 1986; see Fig. 8) and by isolating a similar 
fibril with monoclonal antibodies (Engvall et al., 1986). 

Microfibrils and cross-banded lateral aggregates essentially identi- 
cal to those described by Bruns had been observed in previous studies 
with fibroblast cultures (Goldberg and Green, 1964; Yardley and 
Brown, 1965; Green and Meuth, 1974; Sear et al., 1978). Morphologi- 
cally similar cross-banded fibrils were also frequently observed in 
normal and pathological tissues (Table ID and first described to be 
present in Descemet’s membrane (Jakus, 1956), in the trabecular mesh- 
work of the eye (Garron et al., 1958), and in several nerve tumors (Luse, 
1960). These fibers show, after positive staining, a periodic arrange- 
ment of dark and light bands (Fig. 9) with a lateral repeat of 80-120 nm 
and vary in diameter from several hundred nanometers to a few mi- 
crometers. The electron-lucent zones of tissue fibers very often show the 
presence of fine filaments connecting the dark band zones, thus corre- 
sponding even in fine details to the collagen VI fibers in cell cultures 
(Fig. 7). These tissue structures were referred to as Luse bodies, 
fusiform bodies, and spiney or zebra collagen and were usually found 
close to collagen I fibrils, basement membranes, or cells. Their abun- 
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Fıc. 7. Beaded filaments of collagen VI from chicken embryo tendon fibroblast 
culture medium. (a) Negatively stained beaded microfibril of 3-nm width with pairs of 
beads (arrows) repeating at 100-nm intervals. The elongated and tapered beads are 
about 18 nm in length, and their center-to-center distance is 25 nm. Bar, 100 nm. (b) 
Thin section of the fibroblast culture (embedded in Epon, stained with OsO,, Mg acetate, 
and lead citrate) showing bundles of beaded filaments. Because of positive staining, the 
pair of beads appearing light in (a) are imaged as dark bands here. The 100-nm repeat is 
clearly apparent and gap regions (arrows) are also occasionally seen. Bar, 500 nm. The 
photographs were kindly supplied by Dr. R. Bruns. The original work was described by 
Bruns (1984). Reproduced with permission. 


dance can be increased by treating cell matrices with ATP or by incubat- 
ing tissues under culture conditions (Kajikawa et al., 1980; Bruns et al., 
1986). Such fibers were occasionally considered to represent FLS col- 
lagen (Ramsey, 1965; Banfield et al., 1973; Kajikawa et al., 1980), 
which is presumably a misinterpretation since the periodicity of the 
tissue fibers is only half of that found for collagen I precipitated in 
FLS form (Hashimoto and Ohyama, 1974). The distinct staining ofthe 
cross-banded structures with ruthenium red (Buckwalter et al., 1979; 
Kajikawa et al., 1980) may indicate the presence of proteoglycan in the 
fiber complex. 
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Fic. 8. Immunolabeling of collagen VI microfibrils present in cell culture matrix. (a) 
Thin section of a culture of unfixed human foreskin fibroblasts treated with monoclonal 
antibody to collagen VI and with secondary goat anti-mouse antibody conjugated with 
colloidal gold particles 10.7 nm in diameter. (b) Thin section of a culture fixed with 
glutaraldehyde and incubated with rabbit antiserum against collagen VI and with sec- 
ondary goat anti-rabbit antibody coupled to ferritin. Note in a and b the clusters of 
immunolabel with a periodicity of 100 nm. (c) Thin section, as in a, showing gold parti- 
cles labeling the filamentous interbands of the bundles of beaded filaments indicating 
that this monoclonal antibody is directed to the triple-helical region of collagen VI. (d) 
Thin section of a human skin fibroblast culture that was incubated with anti-GP-250 
antibody and with secondary antibody conjugated with colloidal gold. Labeling of the 
dark bands instead of the filamentous interbands is observed in this case. The pho- 
tographs were kindly supplied by Dr. R. Bruns. The original studies were described by 
Bruns et al. (1986). Reproduced from The Journal of Cell Biology, 1986, 103, 393—404 by 
copyright permission of The Rockefeller University Press. 
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Fic. 9. Cross-banded structures which are possibly related to collagen VI aggregates 
in human nucleus pulposus (a) and neurofibroma skin (b). These structures show a 
periodicity of 100-110 nm and are less abundant in a (arrows) than in b. Bars, 500 nm. 
The electron micrographs were obtained after staining by lead citrate and uranyl ace- 
tate and kindly provided by Dr. Raul Fleischmajer and Mrs. Hanna Wiedemann. 


Biochemical and immunohistological evidence for the presence of 
collagen VI in these cross-banded tissue fibers is still scarce. Ludowieg 
and Parker (1973) solubilized such fibers from the nucleus pulposus 
and found, after reconstitution by dialysis, that they contain signifi- 
cant amounts of hydroxyproline and glycosaminoglycans. Other stud- 


(L61) 391947849 
(E96T) "70 32 BIəqIƏq[1S 
(E961) TOM pUL uueumeN 


(9L6T) (7? 12 yrenbaeyy 
(PL6T) eweÄyg pue ozowryseH 


(TL6T) eduopy pue oJfoW 
(896T) POOMY207] PUL 0PJoIABıYy 
(S961) Aoswey 

(Z961) 72 72 ıpuowrey 


(0961) as] 
(£961) 

usyoy “(8G6T) "yD qa uollen) 

(F961) Id 

(EL61) ‘7? 72 pjoyueg 

(LL6T) ‘7? 12 La[suer-yayosjoulaNy 

(SLOT) SPIEMPY 

(O86T) ‘72 72 emee 

(L961) U2T[Y pue so4e} 

(6L61) ‘72 72 aayfemspng 


(OL6T) 72 72 YeUIOD ‘(896T) 
TUISSBIEILT-TUJEIOS pue YIIWS 


(G861) epemes 


sısAäydodAyouape ey 
adej1]ıe) ITem ƏSNON 
ureaq ey 


EWLOIIBSO9ISO ISNOW 

BWIOUTOIR [Jao snowenbg 
sıow 

-n} apou ydw; veung 


AOUMF SAIOU 
ILISNOIe "BWOUUEMUIS 
aso ueumy 
Jo yIomysauı 1e[Noaqeay, 
yer ‘BATOU TEIgLL 
umıpıesoAw ueung 
UOPUu9} ‘uIys ueung 
urys Asoide] uewny 
ulys yea psanyınd 
sayeuasouloy Ulys dd 


(ggeı ‘ueuny) 
snsodjnd snepnn 

suorpəs [equesuey 

ul suseqjed [euodexay 
‘BWO1]S [BAUIO Ul ITEY 


spueq 
611-08 yaep [euorzippe Aq 
021 popeasiq are sauoz 4431] 


LOT ‘(wu QT) pueq yıep UL 


00188 
G21-58 


001 
081 
LIT-001 
LOT 


081 


001 
Vol 
001704 
06 
081 
0817-06 
08 
98 


001 
spueq 44311 944 
ul s4uəwejiy aul ‘azıs 
021 renba moqe jo spueq 


(Z96T ‘9S6T) snyerf dUBIqWIEU szw OTI-LOT 1431] pue yep Surpeusoy[y 
savua y saınea] (wu) addy, 
[ereds ‘“uoryezipesor] AyDIpoLIag 


STAIMOYIIAN [A NYOVTIOT JO SILYOHUIOYV 


"IVAALV’] ADOTOHA4YOM NI DONFIIWASAY SANSSIL NI GHANASAO SUAEIY AIANVG-SSOUI 


H dav 


TYPE VICOLLAGEN 127 


ies (H. von der Mark, H. Wiedemann, K. Remberger, and R. Timpl, 
unpublished results, 1986) demonstrated the presence of some de- 
graded collagen VI in such reconstituted material which was sup- 
ported by immunogold staining with specific antibodies. Definitive 
proof that collagen VI really generates such cross-banded fibers is, 
however, still lacking. Other cross-banded structures may have no 
relationship to collagen VI at all. This appears likely for Descemet’s 
membrane, which in tangential sections shows a characteristic hex- 
agonal lattice of dumbbell-shaped elements (Jakus, 1956, 1962). These 
structures are sensitive against collagenase (Sawada, 1982), and their 
potential precursors observed in cell culture are larger than collagen 
VI (Sawada et al., 1984). Still another form of fibril found in a few 
tissues (second section in Table II) shows a periodicity of 80-120 nm 
but differs distinctly in segmentation from the collagen VI fibrils pres- 
ent in cell cultures (Bruns, 1984). 

The reconstitution of microfibrils from soluble, purified collagen VI 
monomers or oligomers has so far not been reported. Dialysis of pep- 
sin-solubilized collagen VI against ATP at acidic pH produces fibrillar 
aggregates with a periodicity of 80 nm and a complex symmetric cross- 
striation pattern (Jander et al., 1981). A different fibril pattern, but no 
SLS segments, was observed after precipitating the triple-helical do- 
main of collagen VI with ATP. The existence of such structures in 
tissues is doubtful. The formation of cross-banded fibrils by dialyzing 
pepsin-solubilized collagen VI against neutral buffers has been re- 
ported (Rauterberg et al., 1986), but their relationship to tissue struc- 
tures remains to be established. 


III. BIOSYNTHESIS 


Biosynthesis of collagen VI has been amply documented in cultured 
fibroblasts obtained from adult and embryonic skin (Carter and 
Hakomori, 1981; Trüeb and Bornstein, 1984; von der Mark et al., 1984; 
Aumailley eż al., 1985), lung (Carter, 1982a,b; Engvall et al., 1986), 
nuchal ligament (Sear et al., 1978, 1981a; Gibson and Cleary, 1985), 
tendon (Taylor eż al., 1982), and dental pulp (Shuttleworth et al., 1982) 
as well as for aortic smooth muscle cells (von der Mark et al., 1984; 
Gibson and Cleary, 1985) and a myoblast cell line (Engvall et al., 
1986). Endothelial and epithelial cells and various tumor cell lines 
were found to produce little or no collagen VI and to show a defect in 
matrix deposition (Carter, 1982b, 1984; Hessle and Engvall, 1984; von 
der Mark et al., 1984). Similar observations were reported for SV40- 
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transformed fibroblasts (Carter, 1982b) and attributed to a more than 
20-fold reduction in the amount of collagen VI mRNA (Trüeb et al., 
1985). However, a few tumor cell lines, such as osteosarcoma and 
leiomyosarcoma, may still produce distinct amounts of collagen VI 
(Engvall et al., 1986). Production of collagen VI was monitored by 
labeling with radioactive amino acids or by using radioactive fucose 
(Sear et al., 1978, 1981a), glucosamine, or surface labeling of galactose 
residues (Carter and Hakomori, 1981). Collagen VI chains were then 
identified either by their characteristic electrophoretic mobility or, 
more recently, by antibody reactions using immunoprecipitation (Sear 
et al., 1981a,b; Trüeb et al., 1985; Ayad et al., 1985), binding to mono- 
clonal antibodies (Hessle and Engvall, 1984; Engvall et al., 1986), or 
immunoblotting (Carter, 1982b; Triieb and Bornstein, 1984; Aumailley 
et al., 1985). 

All these data allowed a rather broad but still incomplete insight 
into the biosynthesis and matrix deposition of collagen VI. As expected 
for a collagenous protein, about 25-36% of proline and lysine in the 
a(VI) chains are subjected to hydroxylation as shown by radioactive 
labeling (Sear et al., 1981a) and chemical analysis of purified material 
(Carter, 1982a; see Table I). Extensive posttranslational modifications 
were also demonstrated by a reduction in size of a(VI) chains from a 
M, of 140,000 to 125,000 by culturing cells in the presence of 
tunicamycin and a,a-dipyridyl and in cell-free translation (Trüeb et 
al., 1985). The effect of both drugs was additive, which indicated a 
blockade of incorporation of both N-linked and hydroxylysine-linked 
carbohydrates. Treatment with a,a-dipyridyl also interfered with tri- 
ple-helix formation (due to under-hydroxylation), as judged by reac- 
tion with conformation-specific antibodies, but did not interfere with 
disulfide-bonding of a(VI) chains (Engvall et al., 1986). Some (Sear et 
al., 1981a; Engvall et al., 1986) but not all studies (Carter, 1984; 
Leushner and Haust, 1986) found reduced secretion and matrix deposi- 
tion of collagen VI in the absence of ascorbic acid. 

Collagen VI shows a greater deposition in the cell matrix (50%) 
produced by cultured cells, in contrast to collagens I and III which, in 
large part, are lost to the medium (Carter, 1984). The remaining col- 
lagen VI is about equally distributed between intracellular compart- 
ments and the culture medium. After labeling, collagen VI appears as 
a major cell matrix component together with GP-250 (see below) and 
some fibronectin and procollagens I and III (Sear et al., 1978; Carter 
and Hakomori, 1981). Pulse—chase studies demonstrated that the in- 
tracellular form, but not the medium form, of collagen VI is the pre- 
cursor of the matrix form and that cell contact with the matrix is 
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essential for deposition (Carter, 1984). Deposition is accompanied by 
increased disulfide bonding which results in collagen VI becoming 
insoluble in urea and detergents (Carter, 1982a). These structures ap- 
pear fibrillar by immunofluorescence and electron microscopy while 
intracellular collagen VIshows a granular staining pattern around the 
nucleus (Carter, 1984; von der Mark et al., 1984; Bruns, 1984; Bruns et 
al., 1986; Engvall et al., 1986). Synthesis and deposition of collagen VI 
usually starts soon after attachment of cultured cells but may be pre- 
ceeded by the deposition of fibronectin (Carter, 1982a, 1984; Bruns et 
al., 1986). 

The electrophoretic comparison of a(VI) chains present within cells, 
in cell culture medium, and in the cell matrix showed a single or a 
doublet band with a M, of 140,000 (globular standards), indicating 
only minor differences in size which may be entirely accounted for by 
differences in posttranslational modifications (Carter, 1984; 
Aumailley et al., 1985; Engvall et al., 1986). A similar size of «(VD 
chains was also observed by comparing extracts obtained from cell 
cultures and tissues (von der Mark et al., 1984; Heller-Harrison and 
Carter, 1984; Gibson and Cleary, 1985). The data, therefore, indicate 
the lack of a precursor form (procollagen) of collagen VI or only minor 
processing of a putative precursor involving the removal of a segment 
with a M, less than 10,000, at least for the M, 140,000 a(VI) chains. A 
single study (Trüeb and Bornstein, 1984) is at variance with the other 
data by claiming a M, of 180,000--190,000 for tissue a(V]) chains anda 
20% increase in mass for culture-derived «(VD chains. Conceivably, 
these values were overestimated, since a different size (M, = 140,000) 
was reported in subsequent studies (Trüeb et al., 1985; Crawford et al., 
1985). 

Disulfide bonding of newly synthesized a(VI) chains is presumably a 
fast process as shown for intracellular material isolated with mono- 
clonal antibodies (Engvall et al., 1986). Appropriate pulse—chase stud- 
ies for determining kinetic parameters of chain synthesis and assem- 
bly still remain to be done. The electron microscopic analysis of 
antibody-isolated material indicated abundant amounts of globules 
together with some monomers, aligned and staggered dimers, and 
tetramers within intracellular compartments (Engvall et al., 1986). 
The interpretation of these data is still difficult and will require the 
identification of the nature of the globules and better evidence for the 
presence of aligned dimers. Collagen VI in the culture medium is 
already highly disulfide linked to tetramers as shown by gradient 
centrifugation (Aumailley et al., 1985) and electron microscopy (Eng- 
vall et al., 1986). The latter study also demonstrated mainly tetramers 
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and linear aggregates of tetramers in urea extracts of the cell matrix. 
This material may represent only a small portion of the deposited 
collagen VI as indicated by the studies discussed above. The data taken 
together, however, are compatible with an assembly process proceed- 
ing from monomers through dimers and tetramers which, in the extra- 
cellular space, are connected into microfibrils. Evidence exists for 
some free a(VI) chains in cell culture medium (Carter, 1984; 
Aumailley et al., 1985), but these may arise as assembly artifacts. 

The amounts of collagen VI produced in cell culture relative to those 
of other extracellular matrix proteins are unknown. Radioimmuno- 
assays demonstrated concentrations of collagen VI in fibroblast medi- 
um in the range of 100-200 ng/ml (Schuppan et al., 1985). These 
values are 10 to 100-fold lower than those obtained for procollagens I 
and III and agree with the data which estimate the concentration of 
collagen VI in tissues to be low (Section V). They indicate that collagen 
VI is, in most cases, a minor component of the extracellular matrix. 

The most surprising observation from cell culture studies was the 
apparently intimate association of «(VD) chains with another polypep- 
tide chain. This polypeptide, which we will refer to as GP-250 (Carter 
and Hakomori, 1981), has in reduced form an apparent M, of 240,000- 
270,000 and was shown to be distinct from fibronectin. It was found to 
(1) be resistant to bacterial collagenase, (2) be disulfide bonded to 
larger complexes, and (3) remain associated with collagen VI during 
chromatography and immunoprecipitation (Sear et al., 1981a; Carter, 
1982b). The existence of complexes between GP-250 and collagen VI 
was then convincingly demonstrated by immunoprecipitation with af- 
finity-purified antibodies against a(VI) chains and with monoclonal 
antibodies (Trüeb et al., 1985; Engvall et al., 1986). These complexes 
persisted during reduction under nondenaturing but not denaturing 
conditions and were also formed with under-hydroxylated collagen VI 
(Engvall et al., 1986). GP-250 was also found to behave as a glycopro- 
tein but not as a proteoglycan in CsCl] gradient centrifugation (Shut- 
tleworth et al., 1982). A component with a M, of 260,000 was recently 
reported to be the precursor form of the «3(V]) chain (Trüeb and Win- 
terhalter, 1986). Even though it seems very likely that this precursor 
chain is identical with GP-250, it still remains to be proven experi- 
mentally. 

The nature of the collagen VI chains was also recently studied by 
using various cDNA clones isolated from a human express library 
(Chu et al., 1987). These probes reacted in Northern hybridization with 
three different mRNA species which, by matching peptide sequences, 
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could be identified as encoding the a1(VI) (4.2 kb), a2(VI) (3.5 kb), and 
a3(VI) chain (8.5 kb). The sizes of the mRNAs are therefore, in good 
agreement with those determined at the peptide level (see above). 

Among the M, 140,000 components characterized from fibroblast 
cultures are also polypeptide chains now considered to constitute the 
fibronectin receptor (Pytela et al., 1985; Horwitz et al., 1985; Akiyama 
et al., 1986; Patel and Lodish, 1986). These receptor chains differ from 
collagen VI by their solubility in detergents, lack of disulfide bonding, 
and lack of reaction with concanavalin A. Lack of relationship be- 
tween the receptor and collagen VI was most convincingly demon- 
strated by sequence analysis of cDNA, which failed to detect nu- 
cleotide sequences coding for Gly-X-Y repeats in one of the subunits of 
the receptor (Tamkun et al., 1986). 


IV. IMMUNOLOGY 


Polyclonal antisera specific for collagen VI have been useful tools in 
analyzing various forms of the protein and were instrumental in es- 
tablishing the correlation between intact and degraded forms of col- 
lagen VI (Section I,A). They were raised by using pepsin-solubilized 
collagen VI (von der Mark et al., 1984; Trüeb and Bornstein, 1984; 
Hessle and Engvall, 1984; Schuppan et al., 1985), chain constituents of 
this material (Jander et al., 1981, 1984; von der Mark et al., 1984), and 
intact a(VI) chains (Marton and Arnason, 1982; Carter, 1982b; Gibson 
and Cleary, 1983) as immunogens. Most of the studies demonstrated 
lack of cross-reactions with collagens I to V, 7 S collagen, laminin, and 
fibronectin by enzyme immunoassays, immunoblotting, radioim- 
munoassays, and passive hemagglutination. If such cross-reactions 
were observed (collagen V, fibronectin), cross-adsorptions indicated 
that they arose due to contamination of the immunogen and not by 
shared epitopes (Risteli et al., 1980; Marton and Arnason, 1982; Car- 
ter, 1982b; Gibson and Cleary, 1983; Jander et al., 1984; von der Mark 
et al., 1984; Triieb and Bornstein, 1984; Schuppan et al., 1985). Mono- 
specific antibodies were also prepared from these polyclonal antisera 
by immunoadsorption and were particularly useful in immunohis- 
tological studies (Jander et al., 1981; Carter, 1982b; von der Mark et 
al., 1984). 

Immunochemical studies demonstrated that collagen VI possesses 
potent sequential epitopes which are shared by the triple-helical mole- 
cule and its constituent chains as well as conformational epitopes 
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which are dependent on triple-helical folding and/or disulfide bonding 
(von der Mark et al., 1984). This made polyclonal rabbit antisera raised 
either against triple-helical material or unfolded chains both useful 
for immunoblotting of even small peptides (Heller-Harrison and Car- 
ter, 1984) and for the characterization of various forms of collagen VI 
antigens in biological materials (see below and Section V). Epitopes 
could also be located to the triple-helical and globular domains of 
pepsin-solubilized collagen VI, which are presumably of different 
nature (von der Mark et al., 1984). Diversity of epitopes was also indi- 
cated in immunostaining of both beads and filaments of cell-derived 
collagen VI microfibrils by polyclonal antibodies (Bruns et al., 1986). A 
more restricted reaction pattern was reported for some anti-chain anti- 
sera which stained only al(VDp and a3(VDp chains in immunoblotting 
(Jander et al., 1984). A similar restriction in immunoblot reaction was 
also observed in other (Trüeb and Bornstein, 1984; Crawford et al., 
1985; Gibson and Cleary, 1985) but not all studies. Chain-specific anti- 
bodies can, however, be generated by affinity chromatography on 
a(VDp chains (Trüeb and Winterhalter, 1986). The identification of 
the reacting chains as originating from al(VD, «2(VD), or «3(VD has, 
in most cases, not been confirmed with purified material and should, 
therefore, be considered tentative. 

The development of sensitive and quantitative radioimmunoassays 
for collagen VI appears to be difficult due to a high background bind- 
ing observed for pepsin-solubilized collagen VI (von der Mark et al., 
1984). The choice of high-salt buffers and careful storage conditions 
may overcome these difficulties and allow the quantitation of as little 
as 3 ng/ml of collagen VI by inhibition assays (Schuppan et al., 1985). 
This assay was useful for the detection of collagen VI antigen in 
human serum (22 + 8 ng/ml) and bile but not in urine. Most of the 
circulating collagen VI has a M, > 400,000, indicating the presence of 
oligomers in serum. 

The production of monoclonal antibodies specific for collagen VI was 
achieved by using spleen cells from mice immunized either with whole 
human placenta (Hessle and Engvall, 1984) or pepsin-solubilized col- 
lagen V from chicken gizzard (Linsenmayer et al., 1986). Since both 
materials used for immunization are presumed to contain only small 
amounts of collagen VI, the presence of antibodies indicates a high 
immunogenic potential for collagen VI compared to other collagenous 
proteins. The monoclonal antibodies specific for placenta collagen VI 
were selected for by their ability to bind a complex of a(VD chains and 
GP-250 from cell cultures. Some, but not all, antibodies also reacted 
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with pepsin-solubilized collagen VI, which indicates the recognition of 
several epitopes differing in stability against proteases. Some epitopes 
are characterized by high thermal stability but were also found to be 
nonreactive in under-hydroxylated collagen VI (Hessle and Engvall, 
1984). Whether these data indicate the crucial involvement of hydrox- 
ylated amino acids or the conformational nature of epitopes is not 
quite clear. A distinct conformation dependence of epitopes was dem- 
onstrated in inhibition assays with monoclonal antibodies against 
chick collagen VI (Linsenmayer et al., 1986). 

The various polyclonal and monoclonal antibodies raised against 
collagen VI have been used in many applied studies, such as tissue 
localization (Section V) and establishing the relationship between de- 
graded and intact collagen VI from tissues and cells using immu- 
noblotting, immunoprecipitation, and enzyme immunoassays (Heller- 
Harrison and Carter, 1984; Jander et al., 1984; Hessle and Engvall, 
1984; Trüeb and Bornstein, 1984; von der Mark et al., 1984; Aumailley 
et al., 1985; Gibson and Cleary, 1985; Engvall et al., 1986; Trüeb and 
Winterhalter, 1986). Due to their restricted specificity, monoclonal 
antibodies were also useful for mapping epitopes at the ultrastructural 
level. One of the antibodies showed a distinct binding to the filamen- 
tous elements of collagen VI microfibrils (Fig. 8) indicating its reac- 
tion with the triple-helical domain (Bruns et al., 1986). Monoclonal 
antibody binding could also be demonstrated after rotary shadowing 
(Fig. 10), indicating the reaction with a triple-helical epitope about 25 
nm away from the inner globular domain in collagen VI dimers (Lin- 
senmayer et al., 1986). 

The collagen VI-associated GP-250 component has so far not been 
comprehensively studied by immunological methods. An antiserum 
raised against lung fibroblasts showed, after affinity purification on a 
cell matrix-adsorbent immunoblot, reactions with GP-250 and several 
other polypeptides but not with a(VD chains (Carter, 1982b). Coprecip- 
itation of collagen VI and possibly GP-250, which were named MFP I 
and MFP II in other studies (Sear et al., 1978, 1981a), was observed 
with antiserum against a crude preparation of so-called microfibrillar 
protein. Whether the antiserum contains antibodies against both com- 
ponents is not known. A single monoclonal antibody with a undefined 
origin of the immunogen was also found to bind the collagen VI- 
GP-250 complex (Engvall et al., 1986) and to react with the globular 
beads of collagen VI microfibrils (Bruns et al., 1986; see Fig. 8). This 
antibody still reacted with GP-250 after complete reduction under de- 
naturing conditions but not with a(V]) chains, indicating the recogni- 
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Fic. 10. Visualization by rotary shadowing of monoclonal antibodies bound to mono- 
meric (a) and dimeric (b) chick collagen VI. In each case, the photographs show 12 
selected particles, and the position of the antibody molecule is indicated in a few in- 
stances by an arrow. Bars, 100 nm. The photographs were kindly supplied by Dr. Rich- 
ard Mayne. Reproduced with permission from Linsenmayer et al. (1986). 
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tion of sequential epitopes. The data would be compatible with the 
assumption that GP-250 and the precursor of the a3(VI) chain are 
identical. 


V. Tissur DISTRIBUTION 


A broad tissue distribution of collagen VI was originally recognized 
in biochemical studies which isolated the protein from pepsin digests 
or denaturing extracts obtained from a variety of sources. The tissues 
used included placenta (Furuto and Miller, 1980; Risteli et al., 1980; 
Jander et al., 1981; Abedin et al., 1982; Odermatt et al., 1983; von der 
Mark et al., 1984), large blood vessels (Chung et al., 1976; Gibson and 
Cleary, 1982; Jander et al., 1984; Trüeb and Bornstein, 1984), nuchal 
ligament (Gibson and Cleary, 1982, 1985; Knight et al., 1984; Jander et 
al., 1984), uterus (Abedin et al., 1982; Trüeb and Bornstein, 1984; Ayad 
et al., 1985), liver (Rojkind et al., 1979), kidney (Risteli et al., 1980), 
skin (Laurain et al., 1980), muscle (Marton and Arnason, 1982), neu- 
rofibroma (von der Mark et al., 1984; Fleischmajer et al., 1985), and 
cornea (Linsenmayer et al., 1986; Zimmermann et al., 1986). The 
amounts of collagen VI relative to that of total collagen were deter- 
mined only for a few tissues, and the values reported range from 1.5— 
3% for placenta and uterus, to less than 0.5% for the chorionic and 
amniotic membranes (Abedin et al., 1982), to 0.1% for normal liver 
(Schuppan et al., 1985). Cornea seems to be a tissue particularly rich in 
collagen VI, where it was estimated to represent one-quarter of total 
collagen (Zimmermann et al., 1986). 

The observations of an ubiquitous tissue occurrence of collagen VI 
were confirmed and extended by immunostaining at the light-micro- 
scopic level (indirect immunofluorescence or peroxidase technique) 
using affinity-purified polyclonal antibodies against pepsin-sol- 
ubilized collagen VI (von der Mark et al., 1984) or its constituent 
chains (Jander et al., 1981), monoclonal antibodies (Hessle and Eng- 
vall, 1984; T. F. Linsenmayer, R. R. Bruns, A. Mentzer, and R. Mayne, 
unpublished results, 1986), or antisera specific for a(VI) chains (Mar- 
ton and Arnason, 1982; Gibson and Cleary, 1983). These studies dem- 
onstrated mainly fibrillar staining patterns (Fig. 11) of interstitial 
septa and sinusoids of parenchymal organs (liver, lung, kidney, 
spleen), of the three layers of large arteries, of small blood vessels, and 
of fetal membranes. Intensive staining was also observed in tissues 
with abundant extracellular matrix, such as skin, tendon, and nuchal 
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ligament, in the endoneurium of nerves, in the perichondrium of car- 
tilage, and around muscle cells. Controversy still exists about positive 
(Hessle and Engvall, 1984; Ayad et al., 1984; Linsenmayer et al., 1986) 
or negative reactions (Marton and Arnason, 1982; von der Mark et al., 
1984) of muscle perimysium and hyaline cartilage. Nucleus pulposus, 
however, seems to contain collagen VI or a related protein with a 
staining pattern indistinguishable from that observed for collagen II 
(H. von der Mark, H. Wiedemann, K. Remberger, and R. Timpl, un- 
published results, 1986). A brilliant staining was also observed with 
corneal stroma but apparently did not include Descemet’s membrane 
(Schittny et al., 1985; T. F. Linsenmayer, R. R. Bruns, A. Mentzer, and 
R. Mayne, unpublished results, 1986). 

The staining patterns observed for collagen VI located the protein in 
most of the extracellular spaces which also contain collagens I and III 
and fibronectin. Differences in details of the staining patterns indi- 
cated, however, a lack of close codistribution of these proteins. Studies 
of aortic media and nuchal ligament (Gibson and Cleary, 1983; von der 
Mark et al., 1984) demonstrated a distinct separation of regions 
stained for collagen VI from elastic lamellae (see Fig. 11). The data so 
far also indicate the absence of collagen VI from basement membranes 
except for renal mesangium. A possible presence in cartilage matrix 
cannot be excluded but will depend on biochemical data for a convinc- 
ing demonstration. 

Evidence for a clear anatomical separation of collagen VI structures 
from cross-striated collagen fibrils (types I and III) and elastic fibers 
was obtained by immunoelectron microscopy (von der Mark et al., 
1984). Studies with aorta and placenta frequently showed the ferritin 
stain along microfibrils of 5-10-nm diameter in a regular fashion (Fig. 
12). Other regions which were stained included tissue sections which 
appeared unstructured as well as compartments close to smooth mus- 
cle cells. Staining patterns were often found to be oriented perpen- 
dicular to the direction of major collagen and elastin fibrils. Staining 
of microfibrils and amorphous regions, but not of basement mem- 
branes, was also reported for neurofibroma tissue (Fleischmajer et al., 
1985) and placenta (Amenta et al., 1986). The data agree with those 
obtained from cell cultures which also demonstrated the assembly of 
collagen VI into microfibrillar structures (Fig. 7) independent from 
other fibrillar elements and fibronectin (Bruns et al., 1986). A recent 
study with monoclonal antibodies demonstrated, by immunogold 
staining, regular patterns with 100-nm periodicity in the corneal 
stroma (T. F. Linsenmayer, R. R. Bruns, A. Mentzer, and R. Mayne, 
unpublished results, 1986), which is in agreement with the fibrillar 
models proposed for collagen VI (Section II,D). 
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Fia. 12. Ultrastructural localization of collagen VI microfibrils in human placenta (a) 
by ferritin immunoelectron microscopy. The control section (b) was treated with nonim- 
mune IgG and shows abundant amounts of microfibrils. Most of these microfibrils are 
stained by anti-collagen V antibodies in a periodic fashion (a) as indicated by arrows ina 
typical example. The photographs were kindly provided by Dr. Raul Fleischmajer and 
reproduced with permission from von der Mark et al. (1984). 
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A few earlier studies (Sear et al., 1978, 1981a,b) based on antibody 
precipitation considered the polypeptides which are now known as 
a(VI) chains to be subunits of microfibrils usually associated with 
developing elastin fibers (Cleary and Gibson, 1983). This possibility 
appears rather unlikely considering the unique morphology observed 
for collagen VI fibrils (Bruns, 1984). This interpretation is also not 
supported by immunostaining (see above) and some previous data indi- 
cating the noncollagenous nature of elastin microfibrils. A more re- 
cent study (Ayad et al., 1985) indicated that a(VI) chains and micro- 
fibrillar protein MFP I are different structures, but it is not clear how 
this interpretation can be related to previous data (Sear et al., 1981a). 
Several other microfibrillar structures, such as oxytalan fibers or mi- 
crofibrils stained by ruthenium red, have been identified in the extra- 
cellular matrix by morphological studies (Cleary and Gibson, 1983). 
Whether these fibrils correspond to collagen VI microfibrils (Fig. 12) 
or contain collagen VI in a different macromolecular organization re- 
mains, at present, a matter of speculation. 


VI. POSSIBLE FUNCTIONS AND INVOLVEMENT IN DISEASE 


The ubiquitous occurrence of collagen VI in extracellular matrices 
indicates an important biological role for this protein, but the under- 
standing of its function is still based on guesses rather than solid facts. 
The localization of collagen VI in tissues by immunoelectron micros- 
copy suggested a network of microfibrils which appears to be indepen- 
dent in orientation from the major fibrillar elements (von der Mark et 
al., 1984). The possible contribution of such a network to the mechan- 
ical and biological properties of connective tissue is still ill defined. 
Triieb and Bornstein (1984) suggested an adapter function, with col- 
lagen VI serving as a link for fibrous and globular proteins with cells. 
Strong interactions with collagen VI are so far indicated for GP-250 
(Section III) but are weak or absent for fibronectin and procollagens 
(Carter, 1982b). 

Another interesting possibility was indicated by the work of Carter 
(1982a), who demonstrated attachment and spreading of fibroblasts on 
substrates prepared from native collagen VI. This interaction was de- 
pendent on intact disulfide bonds and could not be blocked by anti- 
bodies against fibronectin. Similar interactions may exist in vivo or in 
cell cultures since collagen VI fibrils are often deposited in close 
vicinity to cells (Carter, 1984; von der Mark et al., 1984; Bruns et al., 
1986). Some preliminary indications that collagen VI structures may 
also be involved in platelet aggregation exist (Morton et al., 1985). A 
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further surprising feature of collagen VI is the obvious absence of 
cross-links other than disulfide bonds or unknown reducible cross- 
links. This suggests that collagen VI networks may be readily modu- 
lated in a reversible manner based, for example, on particularly labile 
intermolecular bonds reducible with 1-10 mM cysteine (Odermatt et 
al., 1983). Whether the formation of cross-banded structures by minor 
experimental treatments (Section II,D) reflects such a possibility re- 
mains to be demonstrated. 

Further clues about the function of collagen VI may be expected 
from studies of changes found under pathological conditions. The 
available data are rather scarce. Immunofluorescence demonstrated 
increased collagen VI staining in cirrhotic liver (Biempica et al., 1977), 
neurofibroma (Fleischmajer et al., 1985), and elastotic areas of sun- 
damaged skin (Chen et al., 1986). An increase of collagen VI in fibrotic 
livers was also demonstrated by radioimmunoassay while changes of 
serum levels were variable in these patients (Schuppan et al., 1985). 
This suggests increased production of collagen VI in some, but perhaps 
not all, fibrotic diseases. 

The best evidence for a pathological increase in collagen VI syn- 
thesis (4 to 6-fold) is available from a study of cultured fibroblasts of a 
patient with cutis laxa (Crawford et al., 1985), a rare, acquired or 
inherited syndrome characterized by loss of skin elasticity. It is not 
unlikely that patients with other inherited connective tissue disorders 
(i.e., Marfan syndrome) show aberrant collagen VI synthesis or even 
possess mutant genes for collagen VI. These intriguing possibilities 
remain to be studied in the near future. 
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I. INTRODUCTION 


The ultrastructure of connective tissues varies dramatically with 
regard to the size, orientation, and morphology of structural elements. 
For example, a morphologically simple tissue such as tendon appears 
to be nearly exclusively composed of a single fiber type. These fibers 
have a uniform banded appearance, are of a similar diameter, and are 
all oriented parallel to the direction of applied stress. Disregarding a 
few fine filaments which occasionally can be seen in micrographs at 
high resolution, this ultrastructure could be relatively easily ex- 
plained in terms of a single collagen molecule capable of a limited 
number of highly specific interactions. In contrast, the matrix of skin 
is highly complex, particularly at the dermal—epidermal junction. In 
skin, banded collagen fibers can be visualized which appear grossly 
similar to the fibers seen in tendon, although the variations of fiber 
diameters are far greater. Skin contains several fibrous elements in 
addition to the banded collagen fibers which are not obviously related 
to collagens based solely upon morphological criteria. However, over 
the past several years it has become increasingly clear that many of 
these fibers are unique forms of collagen. 

Since the first report of collagen heterogeneity over a decade ago, it 
has been tacitly assumed that each additional collagen type could 
eventually be assigned a specific and unique function. In at least some 
cases this probably has been fulfilled. Type VII collagen is one of the 
newly identified members of the collagen family. In this article, a 
summary will be made of the existing data which indicate that this 
unique macromolecule is one of the largest of the known collagens. It 
is capable of forming antiparallel dimers stabilized by intermolecular 
disulfide bonds through a short overlap, which can then condense into 
an unstaggered lateral array just less than two molecules in length. 
The fibrils appear not to be capable of further linear growth, but 
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rather they interact with other molecules (including type IV collagen) 
to form more extended structures. On the basis of molecular dimen- 
sions, banding patterns, tissue distribution, and ultrastructural immu- 
nolocalization, type VII collagen has been recognized as a major and 
perhaps sole component of anchoring fibrils. The anchoring fibrils are 
believed to function by securing the lamina densa, which underlies 
certain epithelia, to its subjacent stroma. 


II. THE STRUCTURE OF Type VII COLLAGEN 
A. The Triple-Helical Domain 


1. ISOLATION OF THE TYPE VII COLLAGEN TRIPLE HELIX 


The type VII collagen triple helix has been solubilized from human 
skin or amniotic membrane by limited proteolysis using pepsin. The 
digestion conditions need to be optimized for each lot of pepsin, but in 
general the following procedure has been successfully utilized (Fig. 1). 
While type VII collagen can be isolated from skin, the use of a simplier 
tissue such as amnion facilitates the procedure. The amniotic mem- 
brane is separated from the chorion within 24 hr following parturition. 
The wet weight of each preparation should be between 180-200 g, 
which is the amniotic yield from approximately 25 placentas. Pooled 
membranes are washed extensively in phosphate-buffered saline to 
remove blood products. The membranes are then washed in ice-cold 
water and homogenized on ice using a polytron (Brinkmann Instru- 
ments). The final volume should be approximately 3.5 liters. 

The homogenate is harvested by centrifugation, and resuspended in 
3 liters of ice-cold 1 M NaCl, 50 mM Tris-HCl, pH 7.5. The suspension 
is stirred at 4°C for 2-3 hr. Particulate materials from the suspension 
are harvested by centrifugation, washed with 3 liters of ice-cold water, 
and reharvested by centrifugation. This material is resuspended in 1 
liter of ice-cold 0.5 M acetic acid, and crystalline pepsin (Boehringer 
Mannheim) is added to 0.08 g per 100 g wet wight. Digestion is car- 
ried out at 4°C for 16 hr. Undigested materials are removed by cen- 
trifugation, and the clarified supernatant solution is gently stirred 
during the addition of solid NaCl to 10%. Protein precipitation is al- 
lowed to proceed for 20 min, at which time the precipitate is harvested 
by centrifugation. The precipitate is redissolved in 1 liter of 1 M NaCl, 
0.1 M Tris-HCl pH 8.1. This solution is stirred for 72 hr at 4°C to 
inactivate pepsin. 
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Fic. 1. Abbreviated flow diagram of the purification of type VII collagen from human 
amnion. 


The solubilized collagens are then partially fractionated by differen- 
tial salt precipitation. Solid NaCl is slowly added to 2.7 M. After stir- 
ring for several hours, the solution is clarified by centrifugation and 
by filtering through glass wool. Solid NaCl is then slowly added to 4 M, 
stirred for several hours or overnight, and the precipitate is harvested 
by centrifugation, redissolved in 250 ml of 0.5 M acetic acid, and stir- 
red at 4°C until redissolved. Solid NaCl is slowly added to 4%, and the 
solution is clarified by centrifugation. Solid NaCl is added to the su- 
pernatant to a final concentration of 8%, and the resulting precipitate 
is harvested by centrifugation and redissolved in 50 ml of ice-cold 2x 
DEAE buffer (4 M urea, 0.5 M NaCl, 0.2 M Tris-HCl, pH 8.3). Strongly 
anionic species are removed from the solution by passing the mixture 
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over DEAE-cellulose equilibrated with 2 M urea, 0.25 M NaCl, 0.1 M 
Tris-HCl, pH 8.3. The flow-through is acidified by the addition of 
acetic acid to 0.5 M, solid NaCl is added to a final concentration of 
14%, and the precipitate is harvested by centrifugation. The precipi- 
tate is redissolved in 50 ml of cold 0.5 M acetic acid and reprecipitated 
by the addition of 200 ml of absolute ethanol. The harvested precipi- 
tate is redissolved in 2x CMC (carboxymethylcellulose) buffer (2 M 
urea, 0.08 M lithium acetate, 0.06 M LiCl, pH 4.8), and diluted with an 
equal volume of water. 

Type VII collagen is separated from type V collagen by chromatogra- 
phy on carboxymethylcellulose (CM-cellulose). The diluted sample is 
applied to a 2.5 x 10 cm column equilibrated with 1x CMC buffer at 
ambiant temperature. Elution is achieved by a superimposed linear 
gradient from 0.03 to 0.28 M LiCl using a total gradient volume of 1 
liter. Under these conditions, native type VII collagen elutes prior to 
type V collagen, at approximately 0.12 M LiCl. Column fractions are 
analyzed by polyacrylamide gel electrophoresis, and identified frac- 
tions are pooled, desalted by dialysis against 0.1 M acetic acid, and 
lyophilized. The lyophilizate has a dry weight of approximately 20 mg, 
about half of which is contributed by type VII collagen and the re- 
mainder by an unknown component which is not stained with Coomas- 
sie Blue. 

Final purification of the native type VII collagen triple-helical do- 
main can be obtained by HPLC-reverse phase chromatography on C18 
or C4 columns equilibrated with 0.1% trifluoroacetic acid maintained 
at 4°C. Elution is performed using a superimposed linear gradient 
from 0 to 52% acetonitrile. Collected fractions are maintained on ice, 
pooled, and lyophilized. 


2. CHARACTERIZATION OF THE TRIPLE-HELICAL DOMAIN 


The product of the above isolation procedures is a high-molecular- 
weight, disulfide-bonded aggregate of identical collagen a chains. Fol- 
lowing disulfide bond reduction and denaturation, only a single elec- 
trophoretic species is observed with an apparent relative molecular 
weight of 150,000 to 200,000 (Fig. 2, lane 2), depending upon which 
collagen chain is used for calibration. A somewhat more accurate es- 
timation of 170,000 was obtained following HPLC-molecular sieve 
chromatography calibrated against known collagen cyanogen bromide 
peptides (Bentz et al., 1983). This triple-helical domain peptide was 
demonstrated to be the product of a unique chain by peptide mapping 


TYPE VII COLLAGEN 149 


Vil P1 P2 

u -(VIDn 
500- =  -(P1)n 
vee) = -(P2)3 
200- 

@ -vil 
De ur = -P2 

2-Me —+ -+ -+ 
12 34 5 6 


Fic. 2. The effects of disulfide bond reduction on the electrophoretic mobilities of type 
VII, P1, and P2. Type VII collagen was substantially converted to peptides P1 and P2 by 
pepsin proteolysis. The reaction products, VII, (P1),, and (P2), were fractionated by 
HPLC-reversed phase chromatography under nondenaturing conditions. Portions of 
type VII (lanes 1 and 2), of (P1), (lanes 3 and 4), and of (P2), (lanes 5 and 6) were 
analyzed by electrophoresis without reduction (lanes 1, 3, and 5) and following complete 
reduction (lanes 2, 4, and 6) with 2-mercaptoethanol (2-Me). Relative molecular weights 
(scale, M, x 10-3) were determined from the mobilities of disulfide-bonded and reduced 
type III collagen (not shown). The results show that type VII, Pl, and P2 are all indepen- 
dently disulfide bonded. P2 alone is covalently associated into trimeric aggregates by 
interchain disulfide bonds, whereas both type VII and P1 are covalently associated into 
higher order aggregates. From Morris et al. (1986) with permission. 


using cyanogen bromide or V8-protease fragmentation. The peptide 
also has a unique amino acid composition (Table I). Based on the con- 
tent of glycine, the composition suggests that the peptide is approx- 
imately 90% triple helical and is characterized by a relatively high 
content of hydroxylysine and a distinctive ratio of leucine to iso- 
leucine. It contains cysteine, but is otherwise unremarkable. 

The type VII collagen triple helix has always been present in small 
amounts in preparations of type V collagen. The amount present var- 
ies widely depending on the extent of proteolysis occurring during the 
solubilization. This is due to the presence of a discontinuity in the 
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TABLE I 
AMINO ACID COMPOSITION OF TYPE VII COLLAGEN” 

Amino acid Type VII P1 P2 
Asx 60 63 68 
Glx 94 94 102 
Hyp 83 105 92 
Ser 41 38 35 
Gly 303 327 338 
His 6.2 Tr’ Tre 
Arg 65 67 59 
Thr 23 18 11 
Ala 61 59 60 
Pro 82 91 81 
Tyr 6.9 3.6 2.4 
Val 22 18 21 
Met 8.9 3.9 6.9 
Cys¢ 5.0 6.8 4.7 
Ile 16 7.5 11 
Leu 48 42 49 
Hyl 41 41 41 
Phe 11 6.5 9.3 
Lys 13 8.7 8.7 


aResidues per 1000 residues. From Morris et al. (1986) with permission. 
dTr, Trace. 
cDetermined as carboxymethylcysteine. 


triple helix nearly midway along its length. This discontinuity can be 
easily demonstrated by reexposure of native isolated type VII collagen 
triple-helical domain to pepsin (100 ug per ml, 4 hr at 4°C). The type 
VII collagen triple helix can be quantitatively converted to two pepsin 
products, noted as P1 and P2 (Fig. 2, lanes 4 and 6). P1 and P2 are limit 
digestion products at this temperature (Bentz et al., 1983). Densi- 
tometric quantitation of the conversion indicates that it occurs with a 
loss of no more than 5% of the initial mass. The relative molecular 
weights of P1 and P2 are 89,000 and 73,000 respectively. 

Native Pl and P2 peptides can be separated from each other and 
from type VII collagen by HPLC-reverse phase chromatography at 
4°C as described above. However, this separation is extremely sen- 
sitive to the extent of pepsin digestion, and the digestion conditions 
must be optimized before routinely utilizing this methodology. De- 
natured P1 and P2 peptides are readily separated by HPLC-molecular 
sieve chromatography using TSK-3000. The uniqueness of P1 and P2 
was demonstrated by the observation of different banding patterns of 
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segment long spacing (SLS) crystallites prepared from native Pl and 
P2 (Burgeson et al., 1985) and by the observation of nonidentical frag- 
mentation patterns observed for P1 and P2 following digestion with 
cyanogen bromide or V8 protease (unpublished results). 


3. TYPE VII COLLAGEN SOLUBILIZED FROM TISSUE USING 
LIMITED PEPSIN DIGESTION Is A DISULFIDE-BONDED 
ANTIPARALLEL DIMER 


As indicated previously, the disulfide bond reduction and denatura- 
tion product of isolated type VII collagen is a single electrophoretic 
species. However, prior to disulfide bond reduction, it migrates more 
slowly than expected for three covalently associated 170 kDa peptides 
(Fig. 2), suggesting that the chains are associated by intermolecular as 
well as interchain disulfide bonds. Similarly, both P1 and P2 peptides 
are independently disulfide bonded. Prior to reduction, P1 peptides 
migrate more slowly than expected for three covalently associated 89 
kDa peptides, but P2 peptides are associated as trimers with no indica- 
tion of higher molecular weight aggregates. These data suggested that 
intermolecular disulfide bonds occur between the P1 regions of the 
triple-helical domain but not between regions containing P2. Using a 
combination of velocity sedimentation and rotary shadowing, we were 
able to demonstrate that both whole type VII collagen triple helix and 
P1 peptides are associated as hexamers, while P2 peptides are associ- 
ated as trimers (Morris et al., 1986). The sedimentation profiles of 
unreduced P1, P2, and the uncleaved triple helix are sufficiently 
monodispersed to suggest that no covalently stabilized aggregate 
larger than a dimer of two molecules commonly occurs. 

The type VII collagen triple helix, P1, and P2 were observed by 
transmission electron microscopy as SLS crystallites (Burgeson et al., 
1985) and as individual molecules by rotary shadowing. Whole triple- 
helical molecules (Fig. 3) and P1 peptides are seen as dimeric struc- 
tures interacting through an overlap region. The dimensions of these 
molecules and of the overlap were calculated and are shown in Table 
II. The banding patterns of SLS crystallites made from whole type VII 
triple helix and from P1 are centrosymmetric, again indicating an 
antiparallel arrangement of two molecules stabilized by intermolec- 
ular disulfide bonds within an overlap of 60 nm of the P1 region. 

The amino acid sequences of the amino-terminal residues of the 
whole triple helix, and of P1 and P2, were determined. The sequences 
obtained from the whole triple helix and from P1 were identical and 
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Fic. 3. Dimers of type VII collagen as visualized by transmission electron microscopy 
following rotary shadowing. Purified native type VII collagen was examined by electron 
microscopy. The image corresponds to a 780-nm dimer of two type VII collagen triple- 
helical domains, each approximately 425 nm in length, overlapping by a distance of 60 
nm. Bar, 100 nm. 
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TABLE II 
MOLECULAR DIMENSIONS AND OVERLAP? 
SLS Rotary shadowing 
measurement (nm) measurement (nm) 
Type I 295 (70%) 297 
Type V 302 
Type VII (individual molecule) 424 (100%) 467 
P1 231 
P2 175 
Total (P1 + P2) 406 (96%) 
Type VII (dimer) 
Calculated (end to end) 8485 
Measured 785 796 
Calculated overlap® 63 
P1 dimer 
Calculated (end to end) 4624 
Measured 405 
Calculated overlap¢ 57 
Measured overlap 59-75 


aFrom Burgeson et al (1985), with permission. 

>Represents 2 x 424. 

°Overlap is obtained by subtracting measured from calculated values. 
dRepresents 2 x 231. 


distinct from the sequences obtained from P2, indicating that the over- 
lap between type VII collagen molecules occurs at the amino-terminus 
(Morris et al., 1986). 

The helix-to-coil transition of the type VII collagen triple-helical 
domain when measured by circular dichroism is biphasic, with a mid- 
point of 41°C for the major transition accounting for approximately 
85% of the amplitude of the total transition and a midpoint of approx- 
imately 55°C for the minor transition (N. P. Morris and H. P. Bäch- 
inger, personal communication). The biphasic nature of the transition 
profile is entirely dependent upon intact disulfide bonds. These data 
suggest that the intermolecular disulfide bonds contained within the 
overlap region confer additional stability to that portion of the triple 
helix. By analogy to studies of the relative insensitivity of the “7 S” 
domain of type IV collagen to bacterial collagenase digestion, the over- 
lap region of native type VII collagen was isolated following clostridial 
collagenase digestion. These studies allowed the isolation of a di- 
sulfide-bonded hexamer with a relative molecular weight for each pep- 
tide of approximately 20,000 (Morris et al., 1986). This peptide ac- 
counts for approximately 90% of the overlap region and is termed C-1 
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Fic. 4. Cyanogen bromide peptide profiles of type VII, P1, and P2 as resolved by 
HPLC-molecular sieve chromatography. Cyanogen bromide (CNBr) peptides of type 
VII, P1, and P2 were individually prepared and analyzed under identical conditions on 
TSK-3000. The patterns indicate that P1 and P2 are distinct peptides which sum to give 
the pattern of whole type VII. In addition, peptide A in type VII contains the pepsin 
cleavage site, as this peptide disappears following reexposure of type VII to pepsin, and 
the peptides Al in P2 and A2 in P1 appear following cleavage. From Morris e al. (1986) 
with permission. 


(see Fig. 9). Thus, the intermolecular disulfide bonds between type VII 
collagen triple-helical domains are confined to a 60-nm-long segment 
at the amino-terminus of each molecule. Interchain disulfide bonds 
may also occur in this region but thus far have not been directly 
demonstrated. 

As described previously, P2 peptides are covalently associated by 
interchain disulfide bonds. In order to investigate the location of the 
interchain disulfide bonds, cyanogen bromide peptides of whole type 
VII collagen triple helix were analyzed before and following disulfide 


TYPE VII COLLAGEN 155 


bond reduction. Reduction-induced changes in electrophoretic mobility 
were confined to peptides smaller than 20 kDa (Morris et al., 1986). A 
large fragmentation-resistant peptide (M, 82,000, termed CB-A), 
which was unchanged by reduction (Fig. 4), has consistently been ob- 
served. This peptide lacks a stoichiometric relationship to the other 
peptides between digests and is therefore likely to be an uncleaved 
peptide with a partially digestion-resistant methionine residue. This 
large peptide is absent from digests of P1 or P2, indicating that the site 
of pepsin-mediated protease activity occurs within this large frag- 
ment. Following pepsin digestion, CB-A partitions between P1 and P2 
as CB-A2 and CB-A1, respectively. CB-Al accounts for approximately 
79% of the mass of P2, beginning at the amino-terminus. CB-Al and 
CB-A2 are not disulfide bonded; therefore, the interchain disulfides 
occurring in P2 must be located in the carboxy-terminal 18% of the 
peptide. Interchain disulfide bonds do not occur within the pepsin- 
sensitive triple-helical discontinuity between P1 and P2. These data 
are summarized in Fig. 9. 


B. The Structure of the Type VII Collagen Biosynthetic Product 


CELL LINE SELECTION 


Identifiable type VII collagen is not easily solubilized from either 
skin or amnion by extraction using acetic acid or nondenaturing neu- 
tral solutions. Extractions using 6 M guanidine hydrochloride allowed 
the observation by rotary shadowing of dimeric molecules with a helix 
length identical to type VII collagen, which contained large disperse 
globules at the distal ends (R. Glanville, personal communication). The 
visualization of molecules similar to type VII collagen, with large ter- 
minal globular domains and the substantial insolubility of type VII 
collagen, strongly suggested that pepsin-mediated solubilization of 
type VII collagen occurs by removal of a noncollagenous domain pre- 
sent within the tissue. In order to fully assess this possibility, we 
sought to obtain a cell culture system which would allow us to isolate 
and characterize the type VII collagen biosynthetic product. To aid this 
search, we prepared a monoclonal antibody using whole type VII col- 
lagen triple helix as antigen. We demonstrated the antibody to be 
specific for type VII collagen and, by rotary shadowing of the antigen— 
antibody complex, demonstrated that the epitope occurs at the carbox- 
yl-terminus of type VII collagen dimers (Sakai et al., 1986). Using this 
antibody, we screened a variety of primary cell cultures and estab- 
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lished cell lines by indirect immunofluorescence for their ability to 
secrete the antigen in culture. Primary cultures of human amniotic 
epithelial cells and skin basal cells demonstrated reactivity with the 
antibody as did the cell lines KB (Eagle, 1955) and WISH (Hayflick, 
1961). Because of the ease of propagating the established cell lines, 
these were chosen for further study. The KB cell line is derived from a 
human oral basal cell carcinoma, while the WISH line arises from a 
spontaneous alteration of cultured amniotic epithelial cells. By a vari- 
ety of criteria, the biosynthetic products of these two cell lines are 
extremely similar, and the cell lines were used indiscriminately for 
the following studies. Either cell line was grown to confluency in 50 
T-100 flasks using standard tissue culture medium supplemented with 
ascorbic acid. On alternate days, the cell medium was harvested. The 
biosynthetic profile over 10 days appeared unchanged. Media proteins 
were precipitated using ammonium sulfate, and type VII collagen was 
partially purified by extraction of the precipitate, followed by a com- 
bination of DEAE-chromatography and velocity sedimentation 
(Lunstrum et al., 1986). Separations were monitored by ELISA assay 
utilizing the monoclonal antibody. The partially purified type VII col- 
lagen and total unfractionated media proteins were analyzed by West- 
ern blotting using the monoclonal antibody (Lunstrum et al., 1986). In 
both cases, the antibody recognizes a single peptide that has an appar- 
ent relative molecular weight of approximately 350,000 following di- 
sulfide bond reduction (Fig. 5). Since this peptide contains both triple- 
helical and globular domains, the true molecular weight could not be 
assessed by this method. To identify the globular domain, the elec- 
trophoretic gel band recognized by the antibody was excised from poly- 
acrylamide gel patterns obtained from the purest preparations of bio- 
synthetic type VII and used to produce polyclonal antibodies in rabbits. 
The resulting antibodies produced identical immunofluorescence and 
Western blotting patterns to the monoclonal antibody and did not rec- 
ognize any species in unfractionated total cell protein other than type 
VII collagen biosynthetic product (Lunstrum et al., 1986). Partially 
purified biosynthetic type VII collagen was subjected to limited pepsin 
digestion under native conditions. The product was resolved by elec- 
trophoresis following disulfide bond reduction and analyzed by West- 
ern blotting using the polyclonal antibody. As expected, the antibody 
recognized a pepsin-resistant region identical in size to the tissue sol- 
ubilized type VII collagen triple helix (Fig. 6). In addition, several 
semistable species were observed with relative molecular weights be- 
tween 50,000 and 150,000 as calibrated using globular standards. 
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Fic. 5. Immunoidentification of type VII procollagen. (A) KB cell extract (lane 1), a 
preparation of secreted media proteins (lane 2), and type VII collagen (lane 3), pepsin 
extracted from human amniotic membrane, were separated by polyacrylamide gel elec- 
trophoresis, and proteins were visualized by Coomassie Blue staining. The media frac- 
tion was also analyzed under nonreducing conditions (lane 4). (B) Identical cell (lane 1) 
and media (lane 2) samples were transferred to nitrocellulose and immunoblotted with 
the type VII collagen-specific monoclonal antibody. A direct comparison is made to 
media proteins transferred and stained with amido black (lane 3). The data demonstrate 
that a type VII collagen biosynthetic product is substantially larger than that obtained 
from whole tissues by pepsin extraction. From Lunstrum e¢ al. (1986) with permission. 


These were subsequently demonstrated to be semistable fragments of 
the carboxy-terminal globular domain (Lunstrum et al., 1986). A sim- 
ilar preparation of biosynthetic type VII collagen was subjected to 
limit bacterial collagenase digestion. The reduction and denaturation 
product was then analyzed by Western blotting using the same poly- 
clonal antibody. The antibody recognizes a single species with an M, of 
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Fıc. 6. Immunoblot analysis of pepsin-digested type VII procollagen. KB cell media 
fractions were incubated without (lane 1) and with (lane 2) pepsin. These samples and 
type VII collagen (lane 3) were fractionated by gel electrophoresis and immunoblotted 
with a type VII procollagen-specific antiserum. The relative-molecular-weight scales 
(M, x 10-3) for globular (left) and collagen helical (right) peptides are indicated and 
were determined using globular protein standards and type I collagen CNBr peptides. 
The antibody identifies a pepsin product identical in size to type VII collagen obtained 
from whole tissue using pepsin solubilization. In addition, the antisera identifies a 
series of pepsin digestion intermediates with consistent and reproducible molecular 
weights. These results suggest that the non-triple-helical domains of type VII collagen 
are complex and may have subdomain structure. From Lunstrum e¢ al. (1986) with 
permission. 


approximately 150,000 following calibration with globular standards 
(Fig. 7). The results indicated that biosynthetic type VII collagen con- 
tains a triple-helical domain identical to that isolated from whole 
tissue by pepsin digestion and at least one globular domain containing 
three identical chains with an M, of approximately 150,000 each. 

To assess the location of this large globular region, partially purified 
biosynthetic type VII collagen was visualized by transmission electron 
microscopy following rotary shadowing (Fig. 8). The images obtained 
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Fic. 7. Identification of an unusually large carboxy-terminal non-triple-helical do- 
main in type VII procollagen. Media fractions were incubated without (lane 1) and with 
(lane 2, +2-Me; lane 3, —2-Me) clostridial collagenase. Duplicate samples were frac- 
tionated by gel electrophoresis. Immunoblot analysis was performed with both the type 
VII collagen-specific monoclonal antibody (A) and the type VII procollagen-specific poly- 
clonal antiserum (B). The relative-molecular-weight scales (M, x 10-3) were deter- 
mined using globular standards. The results indicate that both antibodies identify a 
single peptide after disulfide bond reduction with an apparent relative molecular weight 
of approximately 150,000. From Lunstrum et al. (1986) with permission. 


indicate a mixture of monomeric and dimeric molecules containing 
triple-helical regions equivalent in length and overlap distance to 
those obtained from purified, pepsin-solubilized molecules. Both the 
biosynthetic monomers and dimers demonstrate large and complicated 
arrays of globules at the carboxyl-terminus of the molecules. The com- 
plexity of these regions strongly suggests a substantial subdomain 
structure, with each carboxyl-terminal region demonstrating three 
identifiable arms, each of which appears to be composed of one or more 
globules. This observation of multiple subdomains at the carboxyl- 
terminus is consistent with the immunological identification of sever- 
al semistable pepsin fragments generated from the digestion of bio- 
synthetic type VII collagen. In addition to the large globules at the 
carboxyl-terminus (NC1, Fig. 9), a small globule is obvious in both the 
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Fıc. 8. Transmission electron microscopical visualization of type VII procollagen fol- 
lowing rotary shadowing. Partially purified type VII procollagen obtained from the 
media of cultured KB cells was examined by electron microscopy. Molecules were identi- 
fied that had triple-helix and overlap lengths identical to pepsin-solubilized type VII 
collagen. In addition, these procollagen molecules contain large and complex carboxy- 
terminal globular domains, as well as small amino-terminal globules. Bar, 100 nm. 


H2N NC2 pt — P2 ———+77 COOH 
94 kDa 76 kDa 150 kDa 
233 nm 191 nm 
eo 
CB-A1 CB-A2 C-1 
151 nm 55 nm 60 nm 
60 kDa 22 kDa 20 kDa 


Fic. 9. Diagrammatic representation of our current understanding of type VII pro- 
collagen. Disulfide bonds are indicated by small solid circles. The model depicts the 
relationships in size and relative molecular weight of the various peptide regions which 
have been characterized. The structures and relative sizes of the non-triple-helical do- 
mains are not to scale, and the actual substructures of these regions are far more 
complex than depicted. 
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monomers and dimers at the amino ends flanking the overlap region 
(NC2, Fig. 9). These amino-terminal globular domains are apparently 
not recognized by the polyclonal antibodies, and therefore have not yet 
been further characterized. 

The observation of an amino-terminal globular domain (NC2) was 
not entirely unexpected. Velocity sedimentation analyses of pepsin- 
solubilized type VII collagen triple helix have suggested that these 
domains are required for dimerization and subsequent covalent sta- 
bilization (Morris et al., 1986). Velocity sedimentation of the triple 
helix following disulfide bond reduction without denaturation of the 
triple helix demonstrates that the dimers dissociate to monomers even 
under nearly physiological conditions. These data suggest that the 
interactions between the triple helices within the overlap region are 
not sufficiently stable to resist dissociation under these conditions. It 
is likely, then, that the initial association of the overlap region is 
mediated and stabilized by the amino-terminal globules prior to di- 
sulfide bond formation. 

The observation of both biosynthetic monomers and dimers by ro- 
tary shadowing suggests that dimerization can occur in the cell culture 
media of both KB and WISH cells. At this time, it is not clear whether 
these dimers are stabilized by disulfide bonding. Velocity sedimenta- 
tion analyses occasionally suggest the presence of covalently stabilized 
dimers, but at this time these data are equivocal. The biosynthetic 
studies do suggest that the synthesis and secretion of biosynthetic type 
VII collagen is a slow process. Although the kinetics of secretion, di- 
mer formation, and disulfide-bond stabilization have not yet been 
rigorously studied, the present data suggest that substantial amounts 
of biosynthetic type VII collagen do not accumulate in cell culture 
media during the first half of the 24-hr collection period. 

Thus far, biosynthetic studies have shown no evidence of proteolytic 
processing of biosynthetic type VII collagen in cell culture, although 
these studies have not been rigorously pursued. As has been the case 
with other collagen types, the use of cell culture systems is usually not 
optimal for detection of processing, and these studies are more suitable 
to organ culture systems utilizing long term pulse—chase protocols. 
However, the electrophoretic mobility of biosynthetic type VII col- 
lagen derived from cell culture has been compared by Western blot 
analysis to that of molecules extracted from whole tissue without pep- 
sin digestion (G. Lunstrum, personal communication). These prelimi- 
nary studies suggest that the extracted molecule is slightly smaller 
than that accumulated within the cell culture media. Rotary shadow- 
ing of extracted molecules demonstrates images lacking part or all of 
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the amino-terminal globular domain, suggesting that if processing 
occurs physiologically, it may be restricted to the amino-terminal end 
of the molecules. Since we believe these globular domains to be respon- 
sible for type VII collagen dimerization, we would predict that process- 
ing would occur only following intermolecular disulfide bond forma- 
tion within the overlap region. 


II. THE FUNCTION oF Type VII COLLAGEN 


A. Tissue Distribution 


As with all collagens, the fiber, not the individual molecule, is the 
physiological subunit of connective tissue function. To ascertain the 
function of type VII collagen, the fiber form of this macromolecule was 
investigated, initially by determining the tissue distribution of type 
VII collagen (Sakai et al., 1986) and subsequently by attempting to 
identify the ultrastructural correlate of the type VII collagen fiber 
(Sakai et al., 1986; Keene et al., 1987). Finally, an attempt was made to 
identify inherited disease conditions in which accumulation of the 
macromolecule is deficient. 

The tissue distribution of type VII collagen was determined by indi- 
rect immunofluorescence using the monoclonal antibody to the car- 
boxyl-terminus of the triple-helical domain of type VII collagen as 
previously described (Sakai et al., 1986). A variety of tissues were sur- 
veyed, including skin, chorioamnion, placenta, lung, oral mucosa, 
tongue, bladder, vaginal mucosa, cartilage, and cornea (Table III). In 
general, the antibody produced crisp, brilliant fluorescence in a linear 
pattern underlying the epithelial cells of many external tissues. In 
skin, the basement membrane zone at the dermal-epidermal junction 
was brightly fluorescent, but no staining of the dermal vasculature or 
adnexal structures was observed. In chorioamnion, linear staining was 
found immediately under the amniotic epithelial cells corresponding 
to the basement membrane zone, but was not seen anywhere within 
the stroma or the chorionic pseudo basement membranes. No staining 
of placenta, muscle, normal lung, bladder, or cartilage was observed. 
The data indicate that type VII collagen is found within the basement 
membrane zone of certain epithelial cell layers. Staining could not be 
directly correlated with the complexity of the epithelial cell layer nor 
with the embryologic origin of the cells presumed to be responsible for 
its synthesis. 

As discussed earlier, the banding pattern of SLS crystallites made 
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TABLE III 
CORRELATION OF THE PRESENCE OF Type VII COLLAGEN AS DETECTED BY 
IMMUNOFLUORESCENCE WITH ANCHORING FIBRILS 
DETECTED ULTRASTRUCTURALLY® 


Immunofluorescence Anchoring fibrils 
Skin + + Brody (1960) 
Chorioamnion + + Burgeson ei al. (1986) 
Placenta = = 
Muscle = = 
Eye + + McTigue and Fine (1966) 
Oral mucosa + + Susi ei al. (1967) 
Cervix + + Laguens (1972) 
Kidney = = 
Lung _ = 
Liver = = 
Stomach u u 
Intestine = = 


Elastic cartilage = = 


«From Sakai et al. (1986) with permission. 


from pepsin-solubilized type VII collagen shows a distinctive cen- 
trosymmetric pattern. A striking similarity between this pattern and 
that observed ultrastructurally along anchoring fibrils was noted by 
Dr. Romain Bruns. Since the ultrastructural identification of anchor- 
ing fibrils correlated well with the tissue distribution of type VII col- 
lagen as demonstrated by indirect immunofluorescence, the possible 
relationship between the anchoring fibril and type VII collagen was 
further investigated. 


B. Anchoring Fibrils 


Anchoring fibrils are broad, centrosymmetrically banded structures 
approximately 800 nm in length (Palade and Farquhar, 1965; Bruns, 
1969). They occur within the subbasal lamina of numerous external 
tissues and appear to have two distinct morphologies. They are often 
seen as “horseshoe-shaped” structure with both distal ends inserted 
into the lamina densa and the banded portion of the structure looping 
into the subbasal lamina often surrounding large banded collagen 
fibrils. More frequently, they appear to have one end inserted into the 
lamina densa and extend perpendicularly to the basement membrane 
with the opposite end inserted into an electron-dense amorphous patch 
(Keene et al., 1986). These amorphous patches have been thought to be 
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tangential sections of basement membrane undulations (Palade and 
Farquhar, 1965). Anchoring fibrils are believed to function by provid- 
ing additional structural stability to the subbasal lamina of selected 
tissues by physical entrapment of dermal matrix components. The 
proposed attachment function must either be in addition to or sub- 
stitute for other attachment mechanisms, as not all epithelia overly- 
ing a connective tissue stroma contain anchoring fibrils. A comparison 
of the banding patterns of anchoring fibrils with those observed for 
type VII collagen SLS crystallites suggests that the main body of the 
anchoring fibril is composed of the type VII collagen triple-helical 
domain. The carboxy-terminal globular domain (NC1) would appear to 
insert and be encapsulated within the basement membrane or the 
amorphous electron-dense patches. If this model were correct, mono- 
clonal antibodies directed to the carboxy-terminal end of the triple- 
helical domain should recognize an epitope at the distal ends of banded 
anchoring fibrils and localize the site of the insertion of the anchoring 
fibril into the amorphous electron-dense patches within the subbasal 
lamina. This hypothesis was tested by ultrastructural immunolocali- 
zation using en bloc impregnation of the above-described monoclonal 
antibody into unfixed tissue followed by reaction with a second anti- 
body conjugated with colloidal gold (Sakai et al., 1986). As shown in 
Fig. 10A, gold deposits are found in the predicted sites. The intensity 
of gold labeling at the basal lamina is far less than expected. However, 
this would appear to be a result of the en bloc antibody penetration 
procedure, as antibodies to basal lamina components such as type IV 
collagen or laminin also have difficulty penetrating the lamina densa 
of the dermal—epidermal junction under identical conditions (Keene et 
al., 1986). The diffusional barrier to antibody penetration is different 
at the dermal-epidermal junction than at other skin basement mem- 
branes, for uniform labeling with anti-type IV collagen was easily 
obtained for vascular basement membranes present in skin even using 
the en bloc penetration method. Surface labeling of either Lowacryl- 
embedded skin directed by antibodies to either type IV collagen or to 
type VII collagen (figure 10B) show intense labeling of the lamina 
densa, but banded anchoring fibrils are somewhat more difficult to 
identify (Keene et al., 1987). These data demonstrate that anchoring 
fibrils are specifically labeled as predicted by the proposed structure of 
type VII collagen. This specific labeling, together with the strong sim- 
ilarity in banding patterns between anchoring fibrils and type VII 
collagen SLS crystallites in addition to the correlation of tissue dis- 
tribution shared by anchoring fibrils of type VII, strongly support the 
identification of type VII collagen as a major component of anchoring 
fibrils. 
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Fıc. 10. Immunoelectron microscopic localization of type VII procollagen at the der- 
mal-epidermal junction of human skin. (A) Monospecific anti-type VII procollagen anti- 
sera was infused into unfixed tissue. Following washing, colloidal gold-containing sec- 
ond antibodies were similarly applied, and unbound antibody was removed by extensive 
washing. The tissue was then fixed, stained, and sectioned for examination. By this 
technique, gold is consistently localized to electron-dense patches at the dermal termini 
of anchoring fibrils. (B) The same antisera was applied to the surface of sections cut 
from Lowacryl-embedded foreskin. By this technique, gold is deposited upon the basal 
lamina in addition to the dermal plaques. Bar, 500 nm. 


C. The Anchoring Fibril Network 


In the course of these experiments, it was consistently observed that 
intense antibody-directed gold deposition was seen over the amorphous 
electron-dense patches at the dermal termini of anchoring fibrils. Fur- 
thermore, these patches occur frequently along the basement mem- 
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brane within the subbasal lamina and were most commonly found 
within 0.5-1.5 um of the basal lamina but occasionally were seen as 
far as 2 um distant. The regularity of their appearance appeared to be 
inconsistent with their identification as tangential sections of base- 
ment membrane undulations as previously suggested. In order to test 
this concept, human skin which had been labeled en bloc with anti- 
bodies to type VII collagen, and therefore showed intense labeling of 
these regions, were serial sectioned, and the appearance and disap- 
pearance of these regions were observed (Keene et al., 1987). The elec- 
tron-dense amorphous regions appear and disappear within these 
serial sections and demonstrate no continuity with the basement mem- 
brane other than that provided by interconnecting anchoring fibrils. 
These observations led to the conclusion that these patches were inde- 
pendent of the lamina densa but retained an ultrastructural re- 
semblance to that structure. We have termed these electron-dense re- 
gions as anchoring plaques. These plaques appear throughout the 
subbasal lamina and are interconnected by anchoring fibrils. They are 
frequently seen at a distance of more than one anchoring fibril length 
from the lamina densa and nearly always appear to be the sites of 
anchoring fibril insertion. These observations led to the hypothesis 
that the most common form of anchoring fibril is that which extends 
perpendicularly from the lamina densa, inserting into the lamina den- 
sa at one end and into the anchoring plaque at the other. Anchoring 
plaques are interconnected by additional anchoring fibrils which both 
insert and terminate in plaques. Thus, we hypothesize an extended 
network of anchoring fibrils forming a large scaffold throughout the 
subbasal lamina (illustrated in Fig. 11). This extended architectural 
superstructure is capable of entrapping large numbers of stromal con- 
nective tissue elements including large, banded collagen fibrils and 
noncollagen microfibrils. This concept is particularly appealing in that 
the attachment function of the anchoring fibril has been questioned 
because only a few of the anchoring fibrils could be demonstrated to 
physically entrap dermal connective tissue components (Wasano and 
Yamamoto, 1985). The concept of an extended superstructure circum- 
vents these objections. 

Since the anchoring plaques grossly resemble the basal lamina, 
their composition was investigated using several polyclonal and mono- 
clonal antibodies (Keene et al., 1987). Several monoclonal antibodies 
were previously produced to the carboxy-terminal globular domain of 
type VII collagen. A polyclonal antisera, which recognizes the carboxy- 
terminal globular domain as well as the carboxy-terminal end of the 
triple-helical domain of type VII collagen, was also produced in rabbits 
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Fıc. 11. Diagrammatic representation of the anchoring fibril network. Our current 
hypothesis for the structure of the anchoring fibril network predicts that anchoring 
fibrils are composed of lateral aggregates of type VII procollagen aligned in a nonstag- 
gered array. Anchoring fibrils most frequently originate within the lamina densa and 
extend perpendicularly to the basement membrane, inserting into anchoring plaques 
within the subjacent matrix. Additional anchoring fibrils originate from these plaques 
and insert into other anchoring plaques farther into the matrix. The anchoring plaques 
contain both the carboxy-terminal of type VII procollagen and type IV collagen. The 
resulting structure is an extended network forming a complex scaffolding within the 
subbasal lamina. This extended network is capable of physical entrapment of large 
numbers of matrix components within this region forming a firm connection between 
the lamina densa and the stromal matrix. From Keene et al. (1987) with permission. 


(Lunstrum et al., 1986). Immunoelectron microscopic localization stud- 
ies using both the polyclonal antibody and the monoclonal antibodies 
to the carboxy-terminal globular domain produce identical results to 
those obtained using the monoclonal antibody which recognizes the 
carboxyl-terminus of the triple-helical domain. A monoclonal antibody 
to type IV collagen was previously described (Sakai et al., 1982) which 
recognizes an epitope on each arm of type IV collagen “spiders” about 
60 nm from the 7 S region (Dieringer et al., 1985). This anti-type IV 
collagen monoclonal antibody produced gold deposition on the anchor- 
ing plaques, indicating that they contain type IV collagen. To be cer- 
tain that the plaques identified by the type IV and type VII antibodies 
were identical, double-labeling experiments were performed using the 
polyclonal rabbit antisera to the carboxy-terminal domain of type VII 
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collagen identified by 20-nm colloidal gold-conjugated anti-rabbit IgG 
second antibody and the murine monoclonal anti-type IV collagen 
antibody recognized by a 5-nm gold-conjugated anti-mouse IgG second 
antibody. The results (Keene et al., 1987) indicate the presence of both 
type IV collagen and type VII collagen carboxy-terminal globular do- 
main within the same anchoring plaques. 

Because of the ultrastructural and compositional similarities of the 
anchoring plaques and the lamina densa, the origin of these structures 
could be accounted for by a variety of events. The anchoring plaques 
may arise by tensional stress in which anchoring fibrils may pull away 
pieces of lamina densa into the subbasal lamina, followed by basement 
membrane repair. The plaques could also arise from tissue injury, or 
during basement membrane repair or regeneration occurring during 
growth. Another possibility is that the anchoring plaque is a physio- 
logically significant component of the subbasal lamina and is specifi- 
cally deposited by regulated developmental events. In an attempt to 
assess these possibilities, we have ultrastructurally examined the sub- 
basal lamina regions from a variety of tissues, in skin at a variety of 
locations, in skin as a function of age, and in skin of varying species. 
These surveys, while not exhaustive, indicate that the anchoring fibril 
network as detected by the presence of anchoring plaques exists con- 
tinuously in all tissues examined. It appears to be most extensive in 
the cornea (Fig. 12), but is clearly obvious in all other tissues contain- 
ing anchoring fibrils. In skin, the presence or extent of the network 
does not vary with the site of biopsy, as skin samples taken from the 
lower thorax, leg, thigh, footpad, or lip are ultrastructurally equiv- 
alent with regard to the anchoring plaques. Dorsal skin from neonates 
and from 16-, 31-, and 70-year-old individuals are extremely similar in 
this respect, although some basal lamina reduplication was present in 
the specimen from the 70-year-old individual. No difference in the 
basic occurrence of the anchoring fibril network was observed between 
human, bovine, chick, or mouse skins. These results strongly suggest 
that the anchoring plaques arise from an organized developmental 
program and serve a specific physiological function. Their continuous 
and consistent observation suggest that they do not arise as a function 
of injury and subsequent repair processes. 

There has been considerable speculation that the anchoring fibril 
passes from the dermis, through the lamina densa, and into the lamina 
rara where it inserts into the hemidesmosomes of the basal cell surface 
of skin. Our structural analyses of type VII collagen as related to 
anchoring fibril structure are inconsistent with this concept. To the 
extent that structure can actually be predicted from images produced 
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Fic. 12. Electron microscopic immunolocalization of type VII collagen in adult human 
cornea. Adult human cornea was lightly fixed and embedded in Lowacryl resin at low 
temperature. Sections were surface stained by a monospecific anti-type VII procollagen 
antisera followed by colloidal gold-conjugated second antibody. Gold deposition is seen 
upon the lamina densa as well as over a broad region at the epithelial edge of Bowman’s 
layer. Cornea demonstrates the most extensive anchoring fibril network thus var visu- 
alized. Bar, 500 nm. 


by rotary shadowing, our observations suggest that the complex car- 
boxyl globular domain is entirely contained within the lamina densa 
and is not directly attached to the epithelial cell surface. We cannot 
rule out the possibility that the very fine filaments called anchoring 
filaments (Briggaman and Wheeler, 1975a), which span the lamina 
rara from the hemidesmosome to the lamina densa, may be contained 
within the type VII collagen carboxy terminal globular domain but 
cannot be distinguished by the rotary shadowing technique. At pre- 
sent, we consider this possibility unlikely, as no fine filaments have 
been observed nor would be required within the dermal plaques. We 
believe that it is more likely that the anchoring filament is a separate 
gene product with the specific function of linking the hemidesmosome 
with the lamina densa. This structure may have an affinity for the 
portions of anchoring fibrils contained within the lamina densa as 
anchoring fibrils are frequently seen directly opposite hemidesmo- 
somes. However, anchoring fibrils in the absence of anchoring fila- 
ments are also frequently seen along the basal lamina. 

The presence of the type VII collagen carboxy-terminal domain 
within the lamina densa does appear to influence the formation of 
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hemidesmosomes. The studies of Gipson et al. (1983) indicate that 
epithelial cell sheets denuded of hemidesmosomes will regenerate 
those hemidesmosomes when applied in vitro to an intact basal lamina 
containing anchoring fibrils. The sites of newly formed hemi- 
desmosomes correspond well to the locations of anchoring fibrils along 
the existing basal lamina. One of the first steps in this process appears 
to be the formation of anchoring filaments between the cell surface 
and the sites along the basal lamina expected to be occupied by the 
type VII collagen carboxy-terminal globular domain. The final struc- 
ture of the hemidesmosomes is then elaborated upon this anchoring 
filament network. These studies suggest that the anchoring filaments 
originate along the basal surface of epithelial cells and accumulate 
opposite anchoring fibrils, possibly by recognition of the type VII col- 
lagen carboxyl-terminus. The apparent multidomain structure of the 
terminal ends of type VII collagen molecules is consistent with the 
possibility of multiple specific recognition sites. 


THE FUNCTION OF TYPE VII COLLAGEN 


Our present knowledge of the structure of type VII collagen as it 
relates to the anchoring fibril network suggests that this collagen type 
is one of several molecules which form a specific architectural grid- ` 
work directly linking the basal surface of epithelial cells with the 
underlying dermis. The most obvious consequence of this series of 
associated structural elements is to provide a firm attachment of the 
epithelial cell to its underlying stromal matrix. It is also possible that 
this network provides a special trans-basement-membrane route for 
the transmission of information from the dermis to the epithelial cell 
or vice versa. The mechanism of this information transfer or its possi- 
ble content is completely unknown. The requirement for this anchor- 
ing complex is also unclear. Other epithelia appear firmly affixed to 
their subjacent stroma without the presence of either hemides- 
mosomes, anchoring filaments, or the anchoring fibril network (Ka- 
wanani et al., 1978). It is possible that this additional structural com- 
plex is required for tissues subjected to relatively increased frictional 
or tortional forces. 

There are additional, less well-characterized components which ap- 
pear to be specific to these basement membrane regions as well. For 
example, our use of the en bloc antibody penetration procedure has 
consistently indicated a size-dependent diffusion barrier immediately 
underlying the dermal surface of the dermal-epidermal junction of the 
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basement membrane. Other investigators (Wasano and Yamamoto, 
1985) have observed the deposition of ruthenium red along this region, 
suggesting the presence of strongly anionic dye binding sites. The 
identity of this diffusion barrier with its apparently highly polar sites 
is unknown. It is possible that this unknown basement membrane zone 
element may be a component of this anchoring complex. 


IV. PATHOLOGICAL FINDINGS RELATED TO DISRUPTIONS 
OF THE ANCHORING FIBRIL NETWORK 


If the anchoring fibril network functions to secure the basal lamina 
to the underlying matrix, then any perturbation of this network would 
be expected to weaken the adhesion of the lamina densa to the stroma. 
There are a number of inherited conditions which are characterized by 
separation of skin basal cells from the dermal stroma. These diseases 
comprise the general classification of epidermolysis bullosa. The dys- 
trophic varieties of this disorder are characterized by separations be- 
tween the lamina densa and the dermis consistent with a potential 
defect in the anchoring fibril network (Goldsmith and Briggaman, 
1983). The recessively inherited dystrophic form in its most severe 
occurrence is characterized by a marked decrease or absence of anchor- 
ing fibrils (Briggaman and Wheeler, 1975b) in addition to increased 
collagenolytic activity (Valle and Bauer, 1980). The dominantly inher- 
ited forms also show differences in anchoring fibril morphology and 
may be reduced in number as well (Anton-Lamprecht and Hashimoto, 
1976). 

Monoclonal antibodies that recognize anchoring fibrils in human 
skin have been used to detect these antigens in a number of patients 
with various types of epidermolysis bullosa. The results (Goldsmith 
and Briggaman, 1983) indicate the absence of immunofluorescence in 
severe generalized gravis type recessive dystrophic epidermolysis bull- 
osa. Other types of the disease showed the presence of the antigenic 
determinants. The data suggest a role for anchoring fibrils in this 
disease process. However, the primary cause of the dermal—epidermal 
separation is unclear. It is quite possible that structural defects in type 
VII collagen could be a primary cause of this disease in some ped- 
igrees. It is also possible that loss of the anchoring fibril network could 
be due to a more generalized loss of all junctional components, and the 
involvement of type VII collagen in this destruction is entirely second- 
ary. The substantial complexity of the anchoring fibril network pre- 
dicted by the studies of the structure of type VII collagen strongly 
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suggest that perturbations in this network could arise from multiple 
causes. Further study will be required to understand this involvement 
oftype VII collagen with this disease condition at the molecular level. 
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I. INTRODUCTION 


Type VIII collagen was first discovered during the course of biosyn- 
thetic studies on bovine aortic endothelial (BAE)! cells in vitro (Sage et 
al., 1979). Although these cells produced primarily the interstitial, 
fibrillar type III collagen, approximately 20% of the secreted protein 
that was sensitive to bacterial collagenase displayed unusual proper- 
ties for a collagen. Three different molecular-weight forms were pre- 
sent, ranging from approximately 180,000 to 100,000, and the collagen 
itself was very labile to pepsin. A survey of its biosynthetic distribu- 
tion showed that this collagen was secreted by most, but not all, endo- 
thelial cells that had been established in culture and, for this reason, 
the protein was originally termed EC (endothelial collagen) (Sage et 
al., 1984). EC is now used to denote the individual chains of type VIII 
collagen, a structurally distinct member of the collagen gene family 
(Sage et al., 1983). 

In addition to its unusual structural properties, there were several 
aspects related to the biosynthetic regulation of type VIII collagen that 
appeared intriguing: (1) rapidly proliferating/migrating cells plated at 
low density secreted the highest levels of type VIII collagen into the 


1Abbreviations: B-APN, ß-aminopropionitrile fumarate; DTT, dithiothreitol; BAE, 
bovine aortic endothelial; CM, carboxymethyl; BCE, bovine corneal endothelial; PRF, 
pepsin-resistant fragment; DM, Descemet’s membrane; ELISA, enzyme-linked immu- 
nosorbent assay. 
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culture medium, while virtually no type VIII was recovered from the 
medium of quiescent, confluent cells (Alitalo et al., 1983; Sage et al., 
1986); (2) the collagen was also a biosynthetic product of both normal 
and malignant cells of nonendothelial origin (Sage et al., 1984); and (3) 
secretion of type VIII collagen was modulated with respect to the 
growth state and morphology of endothelial cells in vitro (Sage, 
1985a). Some of these phenotypic switches are listed in Table I. Bovine 
endothelial cells from aorta and brain capillary synthesized primarily 
the interstitial type I collagen and ceased production of type VIII col- 
lagen when they adopted a “sprouting” or elongate morphology in the 
presence of serum. Interstitial collagen production was favored by cap- 
illary cells grown in serum and tumor cell-conditioned medium, while 
cells cultured in plasma-derived serum (lacking platelet components, 
including the platelet-derived growth factor) secreted exclusively 
basement membrane (type IV) and type VIII collagens. 

In this review, we focus primarily on the structural aspects of type 
VIII collagen. Most of these studies have been performed on the meta- 
bolically radiolabeled protein produced by bovine endothelial and 
human astrocytoma cells in vitro. Recent studies on bovine corneal 
Descemet’s membrane have provided evidence that type VIII collagen 
is a major component of this tissue as well (Kapoor et al., 1985, 1986; 
Labermeier and Kenney, 1983). 


TABLE I 
MODULATION OF Type VIII COLLAGEN SYNTHESIS BY 
BOVINE ENDOTHELIAL CELLS in Vitro®> 


Source of endothelium Culture conditions Morphology Collagen type 
Aorta Confluence, FCS¢ Sprouting Villi- I 
Brain capillary Prolonged exposure Elongate VH >I 
to FCS 
Adrenal capillary FCS, TCM4, gelatin Elongate I, I 
substrate 
Retinal capillary PDSe, fibronectin Cobblestone IV, VHI 
substrate 


aSage (1985a). 

Collagen types were identified in the culture medium of metabolically labeled endo- 
thelial cells by DEAE-cellulose chromatography and SDS-PAGE as described by Sage 
and Bornstein (1982). 

eFCS, Fetal calf serum. 

dTCM, Tumor cell-conditioned medium. 

ePDS, Plasma-derived serum. 
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II. Type VIII COLLAGEN FROM CELLS IN CULTURE 


A. Identification and Isolation of the Biosynthetic Product 


Type VIII collagen has several distinguishing characteristics that 
collectively provide a set of criteria by which it can be identified. In 
Fig. 1A is shown the secretory profile of a human astrocytoma cell 
line. Type VIII collagen was secreted rather rapidly (within 1 hr after 
a 1-hr pulse with [?H]proline) primarily as EC2 (M, 125,000). Both 
EC2 and a lower molecular-weight form, EC3 (M, 100,000), were de- 
graded by bacterial collagenase (Fig. 1A, “+” lanes), and their mobili- 
ty on SDS-polyacrylamide gels (SDS-PAGE) was not altered by re- 
duction (not shown; Alitalo et al., 1983). Secretion of type VIII collagen 
by BAE cells is shown in Fig. 1B. In this experiment, an initial chro- 
matographic procedure on DEAE-cellulose provided an enrichment in 
type VIII collagen, which was present as EC1 (M, 177,000) and EC2 (I 
and II, left lanes). These components were secreted in both the pres- 
ence and absence of ascorbate, a cofactor necessary for the complete 
hydroxylation of procollagens by the enzymes prolyl and lysyl hydrox- 
ylase (Fig. 1B, II). The properties of bovine type VIII collagen that are 
readily distinguishable in metabolic radiolabeling studies are, there- 
fore, (1) secretion as EC1, EC2, and EC3, (2) lack of interchain dis- 
ulfide bonds, (3) secretion in the absence of ascorbate , (4) lability to 
bacterial collagenase, and (5) failure to bind to DEAE-cellulose. 

An initial fractionation step in the purification of type VIII collagen 
includes precipitation of radiolabeled culture medium proteins with 
ammonium sulfate. A concentration of 20% (weight-to-initial volume 
ratio) favored the recovery of EC2 and EC3, while greater amounts of 
ammonium sulfate (20-50%) were generally more effective for pre- 
cipitation of EC1 (Sage et al., 1980). Partial purification of type VIII 
collagen was achieved by chromatography on DEAE-cellulose at 4°C in 
6 M urea, as shown in Fig. 2. The initial peak, which eluted before the 
inception of the gradient, contained EC1, EC2, and EC3 (inset, lane I 
as identified by SDS-PAGE). Type VIII collagen was effectively sepa- 
rated from several other proteins secreted in greater abundance by 
bovine endothelial cells, namely fibronectin and type III procollagen 
(inset, lanes II and III). 

Table II lists the cell types that have been shown to synthesize type 
VIII collagen in vitro. This collagen was isolated and characterized 
from both normal and malignant cells representing four mammalian 
genera (mouse, cow, rabbit, and human). Type VIII collagen was the 
major collagenous protein secreted by murine brain capillary endo- 
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Fic. 1. Secretion of type VIII collagen by normal and malignant cells in vitro. (A) A 
human astrocytoma cell line was pulsed with [2H]glycine in serum-free medium contain- 
ing ascorbate and B-APN for 1 hr. This solution was subsequently replaced with medium 
containing ascorbate, ß-APN, and glycine, and radiolabeled proteins were chased into the 
culture medium. At 1 hr, 6 hr, and 3 days (a, b, and c, respectively), equal aliquots were 
removed and incubated with bacterial collagenase (+) or enzyme buffer alone (-). The 
digests and controls were precipitated with 10% trichloroacetic acid and were analyzed by 
SDS-PAGE (5% gel) in the presence of DTT. Type VIII collagen, present as EC2 (M, 
125,000) and EC3 (M, 100,000), is identified. Within 1 hr after a 1-hr pulse, a collagenase- 
sensitive component, EC2, was the major product secreted by the astrocytoma cells. 
Between 24 hr (not shown) and 3 days, EC2 was not present in the culture medium, 
although small amounts of EC3 were apparent. Reproduced in part from Alitalo et al. 
(1983) with permission. (B) Bovine aortic endothelial (BAE) cells were incubated in 
serum-free medium containing ascorbate, B-APN, and |*H]proline or in medium with 
only the isotope. Radioactive proteins were chromatographed on DEAE-cellulose as 
described by Sage et al. (1980) and were analyzed by SDS-PAGE after reduction. I, (a) 
denotes the fraction that was not bound by the DEAE-cellulose; (b) shows proteins that 
were bound and subsequently eluted with 0.5 M NaCl. II, Fractions that did not bind to 
DEAE-cellulose containing proteins metabolically radiolabeled in the presence (+ Asc) or 
absence (— Asc) of ascorbate. Type VIII collagen (EC1 and EC2) is identified, as are two 
major secretory products of BAE cells: fibronectin (FN) and thrombospondin (TS). Type 
VIII collagen elutes from DEAE-cellulose under conditions which promote binding of 
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Fig. 2. Isolation of type VIII collagen from the culture medium of bovine endothelial 
cells by chromatography on DEAE-cellulose. Radiolabeled culture medium proteins, 
secreted by bovine vena cava endothelial cells, were precipitated with 20% ammonium 
sulfate and fractionated on DEAE-cellulose at 4°C (Sage et al., 1980). Arrow indicates 
inception of 0-200 mM NaCl gradient. Peak-containing fractions were analyzed by 
SDS-PAGE (6%/10% gel) under reducing conditions. Inset shows fluorescence auto- 
radiogram of unbound fraction (I) (EC1, EC2, and EC3) and of proteins eluted with 
increasing concentrations of NaCl [peak II contained fibronectin (FN) and peak III 
contained type III procollagen (III)|. From these cells, all three forms of type VIII col- 
lagen were recovered in the 20% ammonium sulfate fraction that did not bind to the 
DEAF-cellulose. Reproduced from Sage et al. (1981) by permission of the American 
Heart Association, Inc. 


thelial cells and by three cell lines derived from human tumors 
(Ewing’s sarcoma, astrocytoma, and hepatocellular carcinoma) (Sage 
et al., 1984). From these studies it was also apparent that not all 
endothelial cells synthesized type VIII collagen (for example, human 
aortic, vena cava, and umbilical vein endothelial cells, and porcine 


most native procollagens. This property, which enables type VIII collagen to be separated 
from other secreted proteins of similar M,, was used to show that the secretion of VIII 
collagen was not dependent on inclusion of ascorbate in the culture medium. Reproduced 
from Sage et al. (1982) with permission. 
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TABLE II 
CELLS WHICH SYNTHESIZE Type VIII CoLLAGEN in Vitro 


Major secreted 


Cell type or line Origin procollagen type 
Endothelial, vascular Bovine aorta III 
Bovine pulmonary artery III 
Bovine vena cava III 
Bovine mesenteric vein III 
Bovine retinal capillary IV 
Murine brain capillary VIII 
Endothelial, corneal Bovine Descemet’s membrane III 
Rabbit Descemet’s membrane IV 
EW-A2 Human Ewing’s sarcoma VIII 
U-251MG Human astrocytoma VIII 
HS 0578T Human carcinosarcoma (breast) I 
Hep 3B21-7 Human hepatocellular carcinoma VIII 
A549 Human alveolar cell carcinoma IV 
HS 0696T Human adenocarcinoma (bone) IV 
HS 0700T Human adenocarcinoma (pelvis) IV 
HS 0746T Human adenocarcinoma (muscle) I 
Fibroblast Human fetal skin I 


“Sage (1984); Sage et al. (1984). 


aortic endothelial cells), and that type VIII collagen was not restricted 
solely to the vascular endothelium (Sage et al., 1984). In addition to its 
presence as a minor collagen in human fetal fibroblasts (Table ID), both 
Benya (1980) and Sage and co-workers (1981) described the secretion 
of significant amounts of type VIII (EC) collagen by corneal endo- 
thelial cells. These results have prompted several investigators to ex- 
amine the Descemet’s membrane as a source of this unusual collagen 
(see Section III). Another conclusion that can be drawn from the data 
presented in Table II is that the synthesis of type VIII collagen is not 
contingent upon expression of particular collagen types, since it was 
present in the culture medium concomitantly with interstitial (types I 
and III) as well as with basement membrane-associated collagens 
(types IV and V) (Sage et al., 1984). 


B. Secretory Properties 


The secretion of type VIII collagen by aortic and corneal endothelial 
cells and by astrocytoma cells was found to be independent of a re- 
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quirement for prolyl hydroxylation (Sage et al., 1982; Alitalo et al., 
1983) (see also Fig. 1B). Inhibition of hydroxylation by 98% with a,«’- 
dipyridyl, a chelator of the ferrous iron cofactor required for prolyl and 
lysyl hydroxylase activity, caused a marked reduction in the secretion 
of type III procollagen by BAE cells with no apparent effect on the 
secretion of type VII collagen (Sage et al., 1982). This property is 
similar to that observed for type IV procollagen but differs markedly 
from the secretory behavior of types I, II, and III procollagen (Crouch 
and Bornstein, 1979). 

From pulse-chase experiments, a synthesis and secretion time of 75 
min was estimated for the release of EC2 into the culture medium by 
BAE and astrocytoma cells (Alitalo et al., 1983; Sage et al., 1983). This 
interval is similar to that reported for type IV procollagen and is 
considerably longer than that required for the secretion of type I pro- 
collagen (Crouch and Bornstein, 1979). As shown in Fig. 3, EC2 (M, 
125,000) was the initial form detected in the culture medium (1.5 hr, 
lane A). At later time points, EC1 (M, 177,000) and EC3 (M, 100,000) 
were apparent. The molecular weights of these chains were unaltered 
in the presence of reducing agent [compare Fig. 3a (-DTT) with Fig. 
3b (+DTT)], and quantitative, stepwise processing to stable, lower mo- 
lecular-weight forms was not observed. This apparent lack of process- 
ing also distinguishes type VIII from the interstitial collagens. Recent 
experiments have shown that the release of type VIII collagen into the 
culture medium is favored significantly in rapidly proliferating BAE 
cells (Sage et al., 1986). This result is in accordance with the data of 
Alitalo et al. (1983), who demonstrated that the amount of type VIII 
collagen present in the culture medium of astrocytoma cells was a 
function of the degree of confluence of the cells (see also Fig. 1A, 
compare 1 hr with 3 days). Type VIII collagen, therefore, appears to be 
sequestered preferentially in the cell layer in confluent, nonproliferat- 
ing cultures. This characteristic could reflect its potential role as an 
extracellular matrix macromolecule that is closely associated with the 
cell surface. 


C. Biochemical Characteristics 


Type VIII collagen was shown to be a unique collagen by 1- and 2- 
dimensional peptide mapping (Sage et al., 1980; Benya, 1980). As 
shown in Fig. 4, the peptide patterns, based on cleavage with cyanogen 
bromide (CNBr), are distinct for each collagen type and provide con- 
vincing evidence that type VIII collagen (EC2) is a novel gene product. 

One interesting property of type VIII collagen is its sensitivity to 
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Fic. 3. Secretory characteristics of type VIII collagen: pulse-chase studies in vitro. 
BAE cells were pulsed with [SH]proline and [3Hlglycine for 1 hr in serum-free medium 
supplemented with ascorbate and B-APN. After removal of pulse medium, cells were 
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pepsin (Sage et al., 1980). Within 5 min following incubation of EC2 
with the enzyme, there was complete degradation of the molecule to 
fragments smaller than M, 50,000--60,000 (Fig. 5A) (Sage et al., 1980; 
Benya, 1980). From the pepsin digest, a peptide of M, 50,000 was 
purified to homogeneity by molecular sieve chromatography. This pep- 
sin-resistant fragment contained nearly equal amounts of hy- 
droxyproline and proline, with 6.6% of hydroxyproline present as the 
3-isomer (Sage et al., 1980). 

Further studies on the triple-helical structure of type VIII collagen 
were performed on the native collagen by exposure of the culture me- 
dium and cell layer proteins directly to pepsin without prior precipita- 
tion or chromatographic procedures. The results have been summa- 
rized in Fig. 5B, which depicts elution profiles, from CM-cellulose 
chromatography, of BAE culture medium (top) and cell layer proteins 
(bottom) after pepsin digestion. Analysis of the peak-containing frac- 
tions by SDS-PAGE revealed type III collagen as well as the M, 
50,000 pepsin-resistant fragment (50K fragment) oftype VIII collagen 
(inset, 50K). The results indicate that type VIII collagen was present 
both in the culture medium and in the cell layer (cells and extra- 
cellular matrix) of BAE cells (Sage et al., 1983). The 50K fragment was 
also produced from the native, secreted molecule and, therefore, did 
not appear to be an artifact resulting from partial denaturation. 

Additional studies were performed to assess the extent to which type 
VIII collagen was triple helical. BAE cells produced the collagen (EC1 
and EC2) over a temperature range of 24-37°C. A gelatin-binding 
domain of fibronectin, which specifically interacts with denatured col- 
lagen chains but will not bind those in native conformation, did not 
associate with type VIII collagen as isolated from DEAE-cellulose 


chased in the same medium containing 10 mM proline in place of the isotope. Each time 
point represents one dish of cells. Culture medium proteins were precipitated by am- 
monium sulfate and were chromatographed on DEAE-cellulose. The unbound (A) and 
bound (B) fractions were analyzed by SDS-PAGE (6%/10% gel) in both the presence (b, 
+DTT) and absence (a, — DTT) of DTT. Type VIH collagen (EC1, EC2, and EC3) has been 
identified on the fluorescence autoradiogram, as has fibronectin (FN), thrombospondin 
(TS), and type III procollagen. Type I procollagen and its processed intermediate forms 
(pC-, pN-) are shown on the right side of the reduced gel, and a standard of EC1 and EC2 
from BAE cell culture medium after 18 hr continuous labeling is shown on the right side 
of the unreduced gel. Within 1.5 hr after a 1 hr pulse, EC2 was secreted into the culture 
medium (lane A). EC1, EC2, and EC3 were visible by 6 hr (lane A) and accumulated in 
the medium for 23 hr. None of these forms of type VII collagen was disulfide bonded 
(compare + DTT with —DTT, lanes A). EC2 migrated between pCa1(D (M, 135,000) and 
pNal() (M, 120,000). Reproduced from Sage et al. (1983) with permission. 
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Fic. 4. Comparison of collagen types by CNBr-derived peptide maps. Bovine types I, 
III, IV, V, and VIII (EC2) collagen were cleaved with CNBr and the products resolved by 
SDS-PAGE (12.5% gel) in the presence of DTT. Some of the major collagen CNBr (CB) 
peptides have been identified. EC2 (VIII) has a unique distribution of CB peptides and 
appeared structurally distinct from the other collagen types. Reproduced with permis- 
sion from Sage © (1982), The Williams & Wilkins Co., Baltimore, Maryland. 
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chromatography. Prior denaturation at 37°C for 10 min, followed by 
affinity chromatography at 4°C, resulted in binding of EC1 and EC2 to 
the fibronectin fragment. It therefore appeared that this association 
occurred only when the binding sites of type VIII collagen were ex- 
posed after a conformational change was induced by heat. Deliberate 
denaturation also altered the susceptibility of type VIII collagen to 
mast cell protease and to vertebrate collagenase (Sage et al., 1983). 

The presence of non-triple-helical globular domains, similar to the 
N- and C-terminal propeptides characterized from the interstitial col- 
lagens, was not observed in type VIII collagen. As shown in Fig. 6, 
incubation of EC1 and EC2 with bacterial collagenase for only 5 min 
did not release any protease-resistant peptides of M, <10,000 (a max- 
imum approximation, based on the limit of detection on a 12.5% poly- 
acrylamide gel), although the EC bands were completely digested (lane 
2). Incubation of the native protein with mast cell protease, an enzyme 
that, under appropriate conditions, specifically releases propeptides 
from native interstitial procollagens, produced no perceptible change 
in the molecular weight of type VIII collagen (lane 3). However, three 
major peptides were produced after denaturation of type VIII collagen 
(lane 4). Since mast cell protease cleaves within the triple-helical se- 
quences of denatured procollagens, it is likely that the peptides (M,s 
70,000, 48,000, and 20,000) shown in lane 4 also contain Gly-X-Y se- 
quence (Sage et al., 1983). A similar spectrum of peptides was produced 
by cleavage of type VIII collagen with vertebrate interstitial col- 
lagenase (lanes 7-9). Based on the specificity of this enzyme, the 
cleavage points within the native type VIII molecule are most probably 
within Gly-X-Y domains that exhibit a somewhat relaxed triple-heli- 
cal structure. Unlike types IV and V collagen, type VIII collagen is a 
substrate for interstitial collagenase (Sage, 1982; Sage et al., 1983). 

A summary of some biochemical properties of type VIII collagen is 
presented in Table III. Experimental data have indicated the following 
structural features for this unusual protein: 


1. it is secreted as three molecular-weight forms (EC1, EC2, and 
EC3) which are not interchain disulfide bonded; 

2. itis a glycoprotein containing significant levels of hydroxyproline; 

3. it does not bind to heparin, fibronectin, or several lectins, although 
after denaturation it displays affinity for the gelatin-binding domain of 
fibronectin; 

4. it is labile to several neutral proteinases; 

5. its secretion is characterized by a minimum secretion time of 75 
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Fic. 5. Degradation of type VIII collagen by pepsin. (A) EC2 was isolated by DEAE- 
cellulose chromatography under conditions that maintain native conformation (Sage et 
al., 1983). A time course of incubation with pepsin is shown (1:100 enzyme-to-substrate 
weight ratio, 4°C, pH 2.8). EC2 and degradation products of M, 95,000 (95K) are indi- 
cated. After 5 min, EC2 was degraded to fragments of M, = 50,000 under conditions in 
which stable «-chains were produced from interstitial procollagens. Reprinted in part 
with permission from Sage, H. Pritzl, P., and Bornstein, P. (1980). Biochemistry 19, 
5747-5755. Copyright (1980) American Chemical Society. (B) Isolation of type VIII 
collagen M, 50,000 pepsin-resistant fragment from endothelial cell layers and culture 
medium. Radiolabeled BAE cell culture medium proteins were dialyzed directly against 
0.5 M acetic acid, digested with pepsin for 6 hr at 4°C, and dialyzed against a denaturing 
buffer in preparation for chromatography on CM-cellulose (Sage et al., 1983). Inset 
shows SDS-PAGE analysis (6%/10% gel) of peaks as numbered in the elution diagram. 
Peak I contained type III collagen [«1(IID)] and peak II contained an M, 50,000 fragment 
(50K) of type VIII collagen. Radiolabeled BAE cell layers (cells and extracellular ma- 
trix) were treated with pepsin and chromatographed as described above. Inset is a 
fluorescence autoradiogram of the major peak, which contained both type III collagen 
{«1(III)] and the 50K fragment of type VIII collagen. The discrepancy in gradient elution 
position between the medium and cell layer 50K fragment has not been explained. 
Reproduced from Kapoor et al. (1985) with permission. 
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Fic. 6. Susceptibility of type VIII collagen to collagenases and mast cell protease. 
Native, radiolabeled type VIII collagen (lane 1, control) was incubated at 37°C for 5 min 
with bacterial collagenase (lane 2) or mast cell protease (lane 3). Type VIII, after de- 
naturation at 45°C for 10 min (lane 5, control), was incubated with mast cell protease for 
5 min at 37°C (lane 4). Another sample of native type VIII collagen containing EC1 and 
EC3 (lane 6, control) was incubated at 24°C with human skin (interstitial) collagenase 
for 12 hr (lane 7), 30 min (lane 8), or 5 min (lane 9). Reactions were terminated by 
addition of SDS-PAGE buffer containing DTT; products were resolved on 12.5% gels 
and visualized by fluorescence autoradiography (Sage et al., 1983, 1984). Bacterial col- 
lagenase digested type VIII collagen completely; globular, noncollagenous propeptides of 
M, > 10,000 were not apparent (lane 2). Similarly, mast cell protease did not release 
putative propeptides from native type VIII collagen (lane 3). After denaturation, howev- 
er, the protein was cleaved into three major collagenous peptides of approximate M, 
65,000, 48,000, and 20,000 (lane 4). Interstitial collagenase initially produced a M, 
100,000 peptide (similar in size to EC3) and a M, 70,000 peptide (lane 9). With increas- 
ing incubation times, other fragments were apparent (approximate M, 50,000 and 
20,000) (lanes 7 and 8). Reproduced in part with permission from Sage et al. © (1984), 
The Williams & Wilkins Co., Baltimore, Maryland. 
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TABLE III 


STRUCTURAL PROPERTIES OF TYPE VIII COLLAGEN” 


Molecular size® 


Disulfide bonding 
Carbohydrate 
Hydroxyproline:proline? 


Degradation by proteases 
Pepsin 
Vertebrate interstitial collagenase 
Bacterial collagenase 
Mast cell protease 


Neutral proteinases (elastase, 
chymotrypsin, trypsin) 

Thrombin 

Schistosomal protease? 

Affinity properties 
Lectins, heparin, fibronectin 
Fibronectin-GBD/ 

Secretion characteristics 
Ascorbate requirement 
Minimum synthesis/secretion time 
Temperature range 
Location 


aSage et al. (1983, 1984). 


EC1, 180,000 

EC2, 125,000 

EC3, 100,000 
EC-PRF°, 50,000 
Intrachain 

Mannose, glucosamine 
EC1, 0.5:1 

EC2, 1:1 

EC-PRF, 1.1:1 


+ (M, 50,000 PRF) 

+ (5-7 fragments, M, 90,000-5,000) 

+ (no peptides of M, > 10,000) 

— (24°C; no “propeptides” M, > 
10,000) 

+ (37°C; 3—5 major fragments) 

+ (28°C, extensive) 


+ (28°C, 5-7 major fragments) 
+ 


+ (after denaturation) 


75 ming 

24-37°C& 

Medium (subconfluent, growing cells) 
Cell layer (confluent, quiescent cells) 


Approximate M, by SDS-PAGE, based on procollagen molecular weight standards. 


¢Pepsin-resistant fragment. 


d3-Hydroxyproline comprised between 7 and 10% of the total hydroxyproline content. 


eMcKerrow et al. (1985). 
fGelatin-binding domain. 
BAE cells. 


min, without the necessity for prolyl hydroxylation, over a temperature 


range of 24-37°C; 


6. it is digested by pepsin to fragment(s) of M, 50,000 that have a 
hydroxyproline to proline ratio of approximately 1:1; 
7. limited proteolysis occurs in the presence of vertebrate interstitial 


collagenase; and 


8. globular propeptide domains of M, 10,000 are not evident after 
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digestion of native type VIII collagen by bacterial collagenase or mast 
cell protease. 


A structural model that incorporates these experimental findings is 
presented in Section IV. 


Ill. Type VIII COLLAGEN FROM TISSUE: 
DESCEMET’S MEMBRANE 


Descemet’s membrane (DM) is the basement membrane which sepa- 
rates corneal endothelial cells from the corneal stroma. Ultrastruc- 
turally, it appears as a hexagonal, netlike matrix that is considerably 
thicker than most basement membranes. This density is due to the 
presence of many lamellar units that are joined by short, cross-linking 
‘bridges’ that are 110 nm apart and perpendicular to the lamellae 
(Murphy et al., 1984). During human postnatal development, a non- 
lamellar, nonstriated component is added posteriorly to the stacks of 
striated lamellar units (Murphy et al., 1984). It is generally regarded 
that the corneal endothelial cells secrete a major portion of the DM. 
Sawada and co-workers (1984) performed an ultrastructural analysis 
of the extracellular matrix produced by BCE cells in vitro. In the 
presence of the cross-link inhibitor B-APN, the cells deposited a hex- 
agonal lattice of 150-nm periodicity that was composed of collagenase- 
sensitive, dumbbell-shaped structures 160 nm in length. Since both 
bovine and rabbit corneal endothelial cells produced type VIII collagen 
in culture (Sage et al., 1981; Benya, 1980), several laboratories have 
examined the DM, an unusually dense basement membrane, as a 
source of this novel collagen. Although Sawada et al. (1984) claimed 
the dumbbell-shaped structures were composed of type IV collagen, 
recent studies have shown that type VIII, and not type IV, collagen is 
the major component of bovine DM (Labermeier and Kenney, 1983; 
Kapoor et al., 1985). 


A. Isolation 


Labermeier et al. (1983) first described the release of collagenous 
polypeptides of M, 50,000 from bovine DM after limited pepsin treat- 
ment. Although its possible relationship to type IV collagen was not 
clarified in this study, the authors proposed that DM was composed of 
a unique basement membrane collagen with a basic subunit of M, 
50,000. Further studies indicated that the major pepsin-resistant col- 
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lagenous moiety in the bovine DM and in the thickened DM of corneas 
from patients with Fuchs’ endothelial dystrophy was a fragment of a 
collagen, termed EC, of M, 50,000 (Labermeier and Kenney, 1983; 
Kenney et al., 1984). In recent studies, Kapoor et al. (1985) have pu- 
rified pepsin-resistant fragments of M, 50,000 from calf and adult 
bovine DM. In many respects, these polypeptides were structurally 
similar to the 50K fragment of type VIII collagen (see Section III,B). 


B. Biochemical Characteristics 


The polypeptides of M, 50,000 that were solubilized by brief pepsin 
treatment of DM did not contain interchain disulfide bonds and were 
degraded by bacterial collagenase (Kapoor et al., 1985). Peptide map- 
ping (1- and 2-dimensional) of the 50K polypeptides showed that they 
were distinct from several other collagen types, including pepsin-re- 
sistant fragments of type IV collagen (M,s 140,000, 70,000, and 
50,000). In addition, antibodies raised against the 50K polypeptides 
from DM did not react with collagen types I, III, IV, V, and VI by 
ELISA and Western immunoblotting; conversely, antibodies specific 
for these other collagens did not cross react with the 50 K polypeptides. 
The unique nature of these pepsin-resistant polypeptides from DM 
was confirmed by amino acid analysis of 50K-A and 50K-B, two chains 
that were purified by high-performance liquid chromatography. In 
addition to differences between each other, 50K-A and 50K-B were 
distinct from the other collagen types that have been shown to be 
present in cornea (Kapoor et al., 1985). The identity of these polypep- 
tides as derivatives of type VIII collagen was confirmed by immu- 
noblotting (1) BCE cell culture medium type VIII collagen, (2) the 50K 
pepsin-resistant fragment from BCE culture medium, and (3) the 50K 
fragment from the BCE cell layer, with anti-50K DM collagen anti- 
serum. Positive reactions obtained from all three samples thus con- 
firmed that the M, 50,000 polypeptides purified from bovine DM and 
BCE cell type VIII collagen were structurally related (Kapoor et al., 
1986). 

[125]]-Peptide maps (2-dimensional) of bovine DM collagenous poly- 
peptides (M, 50,000) were also published by Labermeier and Kenney 
(1983) and provided the basis for their preliminary identification as 
EC, or type VIII collagen (Sage et al., 1980). However, the amino acid 
composition of the M, 50,000 material isolated by Labermeier and 
Kenney (1983) was significantly different from that published by Ka- 
poor et al. (1985). Although there are obvious structural similarities 
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among all of the M, 50,000 collagenous components isolated from 
bovine DM, it is possible that several distinct chains are present in this 
tissue, all of which are related to type VIII collagen. 


IV. STRUCTURE oF Typer VIII COLLAGEN 


A. Cassette Model 


Based largely on data obtained from protease digestion studies, Sage 
et al. (1983) proposed a model for the structure of type VIII collagen. 
Three triple-helical domains of M, 50,000, which are resistant to pep- 
sin, nondisulfide bonded, and composed of approximately equal 
amounts of hydroxyproline and proline, are joined by noncollagenous 
segments of M, <10,000 that are resistant to bacterial collagenase. 
Large, globular N- and/or C-terminal propeptides are absent. The “par- 
ent” chain, EC1 of M, 175,000-180,000, would therefore contain three 
cassettes of M, 50,000, two interconnecting noncollagenous sequences, 
and one or two N- or C-terminal short segments (maximum M, of 
10,000). EC2 and EC3 would have two triple-helical domains (M, 
50,000, with EC3 lacking any terminal extensions. Homology among 
EC1, EC2, and EC3 was indicated by CNBr and 2-dimensional peptide 
mapping studies (Sage et al., 1980); however, processing of EC1 to 
smaller forms has not been demonstrated in vitro. That the 50K cas- 
settes themselves might be different is suggested by peptide mapping, 
amino acid composition, and N-terminal sequence analyses of 50K-A 
and 50K-B from DM (Kapoor et al., 1985, 1986). Conservation of the 
cleavage site for vertebrate collagenase in each cassette would result in 
multiples of fragments, as shown in Fig. 6 and by Sage (1982) and Sage 
et al. (1983). 

A similar model was proposed by Benya (1980) for the largest triple- 
helical form of EC (type VIII collagen) of approximate M, 540,000. 
Cleavage of native (EC1), by pepsin would produce an M, 180,000 
fragment [(60K);], while endogenous proteolytic activity could explain 
the presence of other fragments, including (EC2), of M, 360,000. After 
denaturation, three major peptides of M, 180,000, 120,000, and 60,000 
were recovered. The essential feature of both models is the existence of 
interruptions within the collagen triple helix and the presence of dis- 
crete collagenous domains of M, 50,000--60,000. Since the designa- 
tions EC1, EC2, and EC3 refer only to size, it is not known whether 
there is more than one unique chain that participates in the assembly 
of type VIII collagen. The existence of oligomeric forms of type VIII 
collagen has not been described. 
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Recently Benya and Padilla (1986) have suggested that bovine cor- 
neal endothelial (BCE) cells synthesize type VIII collagen in the form 
of a relatively small triple helix with a chain M, of 61,000. These 
authors have found that a large fraction of the higher molecular- 
weight type VIII collagen chains in the culture medium of postconflu- 
ent BCE cells (similar in size to EC1 and EC2) can be dissociated with 
a combination of acid pH and SDS to chains of M, 61,000. These higher 
molecular-weight forms, therefore, appear to be stabilized by strong, 
noncovalent, acid-labile interactions. A fraction of the higher mo- 
lecular-weight type VIII collagen produced by BCE cells is, however, 
resistant to such dissociation. It is possible that the lower molecular- 
weight form of type VIII collagen studied by Benya and Padilla (1986) 
represents an interesting processed form of the protein, but a complete 
resolution of these new findings with the models summarized above is 
not possible at this time. 


B. Triple-Helical Domains of Approximate M, 50,000 


Studies on the nonfibrillar collagen types have revealed triple-heli- 
cal domains of approximately 50,000 in molecular weight. Type X 
collagen, which has a chain M, of 59,000, was converted by pepsin to a 
triple-helical molecule of three M, 45,000 chains (Schmid and Conrad, 
1982; Gibson et al., 1982). In contrast, type VI collagen has very large 
globular domains which account for approximately 50% of the total 
molecular mass of the protein (Triieb and Bornstein, 1984; von der 
Mark et al., 1984). Cleavage of the precursor form of type VI collagen 
with pepsin, followed by reduction and alkylation, produced three a 
chains of M, 40,000--55,000 (Jander et al., 1981), although Furthmayr 
and co-workers (Odermatt et al., 1983) have estimated from electron 
microscopic measurements that the maximum size of the triple-helical 
region was 35,000—40,000 Da per chain. 

Early studies on the pepsin-derived fragments of type IV collagen 
demonstrated that at least one polypeptide of M, 50,000 was derived 
from each of the proal and pro«a2 chains of this protein (Kresina and 
Miller, 1979; Dixit, 1978). This basement membrane collagen contains 
several interruptions within the triple helix, although periodicity with 
respect to an M, 50,000 Gly-X-Y sequence has not been found. Re- 
cently, Dixit and co-workers (Dixit et al., 1985) have isolated a poly- 
peptide of M, 55,000 from guanidinium chloride extracts of bovine 
lens capsule. This chain might correspond to a3(IV) or could represent 
yet another collagen type. 

It therefore appears that the existence of domains of M, 50,000 that 


192 HELENE SAGE AND PAUL BORNSTEIN 


are capable of triple-helix formation is not an exclusive property of 
type VIII collagen. Such domains can occur singly (as in types VI and X 
collagen), or perhaps in multiples (types IV and VIII collagen?), and 
are typically flanked by noncollagenous “linkers.” Neither immu- 
nologic nor protein-sequencing studies has revealed homology among 
these triple-helical domains, other than the conservation of the triplet 
repeat that characterizes all members of the collagen gene family. 


C. Inclusion in the Nonfibrillar Collagen Gene Family? 


The structure of type IX collagen has recently been elucidated (van 
der Rest et al., 1985). It consists of three collagenous domains (of 137, 
339, and 115 residues per chain, respectively) and four noncollagenous 
domains, in each of three unique a chains, that associate by disulfide 
bonds to form the type IX collagen molecule. Based on the exon struc- 
ture of the al(IX) and a2(IX) collagen genes, Olsen and coworkers 
have proposed a distinct class of collagen genes (B) that is separate 
from that of the fibrillar collagens [types I, II, and III (class A)] 
(Lozano et al., 1985). The class A genes are characterized by exon sizes 
related to a basic 54-base pair coding unit, while the size of exons 
comprising the class B genes is more irregular. Type IV collagen, from 
which three exons of 64, 123, and 182 base pairs have recently been 
characterized, may also belong to the class B gene family (Kurkinen et 
al., 1985). 

Sage et al. (1984) proposed a novel class of collagenous proteins 
which were characterized by imperfect triple helices and a mode of 
secretion and assembly that was highly different from that of the 
fibrillar collagens. Several new collagens (or their genes) containing 
very short triple-helical domains have been described in Drosophila, C. 
elegans, and fetal calf ligament fibroblasts [see Sage (1985b) and ref- 
erences therein]. It is possible that these proteins with shortened or 
interrupted triple-helical sequences would interact preferentially with 
other macromolecules of the extracellular matrix. Selective pressure 
for the rapid folding, secretion, and assembly of perfect triple-helical 
molecules might then have occurred in only one class of collagen genes 
(class A), specifically for the fibrillar collagens that contribute in large 
part to the interstitium. 


V. SUMMARY AND CONCLUSIONS 


Based on its distribution in vitro, we predict that type VIII collagen 
(1) is a minor component of most mesenchymal tissues including the 
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vascular tree, (2) is a major component of corneal Descemet’s mem- 
brane, and (3) forms part ofthe extracellular matrix and may be close- 
ly associated with the cell surface. Its synthesis by differentiated endo- 
thelial cells and sequestration to confluent, quiescent cell layers 
suggest that this collagen could facilitate stabilization of the charac- 
teristic endothelial phenotype. Cessation of type VII collagen syn- 
thesis might then be associated with cellular injury and/or loss of the 
differentiated phenotype (as occurs during angiogenesis). 

Type VIII collagen belongs to a unique subclass of the collagen gene 
family that is distinguished, in part, by an interrupted triple helix and 
a mode of secretion that differs from those of the fibrillar interstitial 
collagen types. The presence of several triple-helical domains (of M, 
50,000) and apparent lack of large, globular propeptide sequences ac- 
count for several of the unusual features of type VIII collagen. The 
selective lability shared by types IV, V, and VIII collagen to proteases 
associated with injury, such as thrombin and a schistosomal serine 
proteinase, suggests a role for cell surface and/or basement membrane 
collagens in the maintenance of cellular integrity and a differentiated 
phenotype. 
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I. INTRODUCTION 


In addition to type II collagen, several other collagenous molecules 
are present in cartilages and can be solubilized after extensive diges- 
tion with pepsin [reviewed by Miller (1976) and Mayne and von der 
Mark (1983)]. Burgeson and Hollister (1979) first discovered that the 
pepsin digest contains small amounts of a native collagen which is 
assembled from three additional « chains of sizes similar to the «l(ID 
chain. This molecule (or molecules) is now known as type XI or 1a2a3« 
collagen (see Eyre and Wu, this volume). In the early 1980s, several 
reports were published describing disulfide-bonded and highly soluble 
collagenous fragments that were also solubilized by pepsin digestion 
and whose helical domains were much smaller than the interstitial 
collagens (Shimokomaki et al., 1980, 1981; Reese and Mayne, 1981; 
Ayad et al., 1981, 1982; von der Mark et al., 1982; Ricard-Blum et al., 
1982). Except for type X collagen, which is synthesized only by hyper- 
trophic chondrocytes (see Schmid and Linsenmayer, this volume), it is 
now clear that these fragments originate from a single parent mole- 
cule, type IX collagen. 

In this chapter, we will initially present a model for the structure of 
the intact type IX collagen molecule. This model was derived from 
both biochemical (Ninomiya and Olsen, 1984; van der Rest et al., 1985; 
Mayne et al., 1985) and rotary shadowing (Irwin et al., 1985) observa- 
tions. Subsequently, we will present the detailed evidence which sup- 
ports this model together with some speculation as to the possible 
function of this collagen. 

195 


STRUCTURE AND FUNCTION OF Copyright © 1987 by Academic Press, Inc. 
COLLAGEN TYPES All rights of reproduction in any form reserved. 


196 MICHEL VAN DER REST AND RICHARD MAYNE 
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Fic. 1. Extended model for the structure of type IX collagen. The molecule consists of 
three collagenous, triple-helical domains (COL1-COL3; open bars) and four non- 
collagenous domains (NC1-NC4; solid lines). The model was initially derived from 
nucleotide sequencing of a cDNA clone that did not extend completely to the N-terminus 
(see Ninomiya and Olsen, 1984). Subsequent studies show that the NC4 domain of the 
al(IX) chain is much larger than the NC4 domains of the a2(IX) or a3(IX) chains 
(Konomi et al., 1987; Huber et al., 1986). The N-terminus is to the left, the C-terminus to 
the right. 


In Fig. 1, the structure of type IX collagen is drawn in an extended 
form and shown to consist of three collagenous domains (COL1—COL3) 
and four noncollagenous domains (NC1—NC4). The numbering of 
these domains is from the carboxyl-terminus. This occurs as the model 
was initially derived from the sequencing of cDNA clones and the 
conversion of the nucleotide sequence of these clones to an amino acid 
sequence (Ninomiya and Olsen, 1984; van der Rest et al., 1985; 
Ninomiya et al., 1985). Such clones always begin at the 3’ end of the 
nucleotide sequence or, at the protein level, from the carboxyl-termi- 
nus. Further results, based on biochemical analyses of the pepsin- 
resistant fragments, show that the molecule consists of three genet- 
ically distinct chains, designated o1(IX), «2(IX) and «3(IX), and that 
these three chains are present in equimolar amounts in a single mole- 
cule (Mayne et al., 1985; van der Rest et al., 1985; Ninomiya et al., 
1985). Evidence in support of this model also comes from rotary shad- 
owing observations of the intact type IX collagen molecule as secreted 
by a suspension culture of embryonic chick chondrocytes (Fig. 2; also 
see Irwin et al., 1985). The NC4 domain is visualized as a small knob at 
one end of the molecule. In addition, the NC3 domain is highly flexible 
and, in many of the molecules, is viewed as a kink of no fixed angle. 
The NC1 domain cannot be visualized by rotary shadowing, and a kink 
is not easily recognized at the NC2 domain. However, the overall 
length of the molecule corresponds closely to the predicted lengths of 
the three collagenous domains (Mayne et al., 1986). 


Il. Tue Prepsin-Resistant FRAGMENTS 


A. Mammalian Species 


The first fragment of type IX collagen was isolated from pig hyaline 
cartilage after pepsin digestion (Shimokomaki et al., 1980). The mole- 
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Fıc. 2. Gallery of selected type IX collagen molecules visualized after rotary shadow- 
ing of the collagenous molecules secreted by a suspension culture of embryonic chick 
chondrocytes. Note that the NC4 domain is visualized as a small knob. In addition, a 
prominent kink is present in most of the molecules at the NC3 domain. Bar, 100 nm. 


cule was of M, 110,000 before reduction of interchain disulfide bridges 
and yielded a single band of M, 33,000 after reduction. It was called 
type M collagen. The fragment was more soluble than other previously 
reported cartilage collagens and was not precipitated from a pepsin 
extract until 2.0 M NaCl, 0.6 M HAc were used. Later, additional 
smaller molecular-weight peptides were described by the same group 
and called CF1 and CF2 (Shimokomaki et al., 1981). 

The pepsin-resistant fragments of bovine type IX collagen were in- 
vestigated independently by Ayad et al. (1981, 1982), using nasal car- 
tilage, and by Ricard-Blum et al. (1982, 1985), using epiphyseal car- 
tilage. The former group also described similar fractions isolated from 
human intervertebral disc. The separation methods used by the two 
groups were quite different. Ayad et al. (1981, 1982) found that type IX 
fragments were soluble in 20 mM Na HPO, at pH 9.2, whereas type 
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XI collagen (la2a3a collagen) was precipitated. Ricard-Blum et al. 
(1982), however, used a high NaCl concentration in acetic acid to sepa- 
rate the type IX fragments from type II and type XI collagens which 
precipitated at lower salt concentrations. The high-molecular-weight 
component (CPS 1), as described by Ayad et al. (1981), was of M, 
110,000 before reduction and M, 33,000 after reduction and, therefore, 
closely resembled the M fragment previously described by 
Shimokomaki et al. (1980). A low-molecular-weight component called 
CPS 2 (Ayad et al., 1982) was subsequently isolated and separated 
after reduction into three components of M, 16,000, 10,000, and 8,000. 
The same fragments were observed by Ricard-Blum et al. (1982) and 
called X5, X6 and X7 respectively. This latter group also obtained a 
number of higher molecular-weight fragments, which were called X1 
to X4. 

A model for the arrangement of the various pepsin-resistant frag- 
ments of bovine type IX collagen was proposed by Ricard-Blum et al. 
(1985). This model supports the view that the structural organization 
of mammalian type IX collagen is quite similar to the avian one to be 
described below, except for some differences in the location of the 
pepsin-sensitive sites. 

Ayad et al. (1981, 1982) also presented electron micrographs of seg- 
ment long spacing (SLS) crystallites of the CPS 1 and CPS 2 frag- 
ments. As expected from the molecular weight of these fragments, the 
crystallites were about one-third of the length of those obtained for 
type I collagen for CPS 1 and one-seventh for CPS 2. 


B. Chicken 


The pepsin fragments of chicken type IX collagen were described in 
previous publications as the high-molecular-weight components 
(HMW) and low-molecular-weight components (LMW) (Reese and 
Mayne, 1981; Reese et al., 1982; Bruckner et al., 1983: Mayne et al., 
1985, 1986; van der Rest et al., 1985), or as M1 and M2 (von der Mark et 
al., 1982). For the sake of consistency, we will use the former nomen- 
clature for the rest of this chapter. 

HMW and LMW were prepared by differential salt precipitation in 
0.5 M acetic acid from pepsin-solubilized cartilage collagen. The type IX 
fragments were precipitated at 2.0 M NaCl after precipitation of type I 
and type XI (1a2«3«) collagen at 0.9 M NaCl and 1.2 M NaCl, respec- 
tively. During agarose gel filtration, HMW eluted as a double peak at 
the location of the ß and a chains of type I collagen, although on SDS- 
PAGE it migrated as a single band between the B and « chains. After 
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reduction, HMW migrated on SDS-PAGE as three components, called 
C1,C2, and C3 (with M,s of 87,500, 51,000, and 36,400, respectively) as 
determined by agarose gel filtration (Reese and Mayne, 1981). Subse- 
quently, a fourth component called C4 (M,, 14,000) was shown to partici- 
pate in the assembly of HMW, although it is not disulfide bonded to the 
other components (Reese et al., 1982). It was found that C4 will remain 
bound to the HMW fragment on agarose gel filtration in 1 M CaCl, 
provided the sample is not heat denatured prior to application to the 
column (see Fig. 3 for a model of HMW which shows the location of C4). 
Examination of HMW by rotary shadowing showed a prominent kink in 
the molecule giving rise to both a short and long arm (Reese etal., 1982; 
von der Mark et al., 1982). Removal of the C4 component by denatura- 
tion altered the rotary shadowing image of HMW, causing the short 
arm not to be visualized. This result indicates that C4 forms part of the 
triple-helical structure of the short arm (Reese et al., 1982). In these 
conditions the long arm renatures due to the presence of interchain 
disulfide bridges but, with the loss of C4, the short arm cannot renature 
and is not visualized by rotary shadowing. In later experiments, the C2 
component was resolved by reversed-phase high-performance liquid 
chromatography (HPLC) into equimolar amounts of two chains, which 
were shown to represent different primary structures by fingerprint- 
ing of their tryptic fragments (van der Rest et al., 1985) and by partial 
amino acid sequence analysis [van der Rest et al. (1985) and un- 
published observations]. The two chains were called C2 and C5 and are 
incorporated into the model of HMW as shown in Fig. 3. The C1 compo- 
nent probably arises from a nonreducible cross-link between C3 and 
either C2 or C5. 

The low-molecular-weight component LMW was also present in the 
precipitate at 2.0 M NaCl, 0.5 M HAc, and could be separated from 
HMW by agarose gel filtration. LMW, after reduction and alkylation, 


c2 U 
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Fic. 3. Models for the structure of the pepsin-resistant fragments of chicken type IX 
collagen called HMW and LMW. Note that both HMW and LMW contain three different 
chains, and that, for HMW, one of the chains is cleaved giving rise to C3 and C4. This 
occurs at the NC3 domain (see Fig. 1), and in this domain the interchain disulfide 
bridges (S—S) of HMW are probably present. LMW also possesses interchain disulfide 
bridges which are present at the carboxyl-terminus within the NC1 domain (see van der 
Rest et al., 1985). 
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Fic. 4. Carboxymethylcellulose (CM-cellulose) chromatography of the chains of LMW 
obtained after reduction and alkylation followed by agarose gel filtration. The column 
(1.5 x 10 cm) was equilibrated at 42°C with 0.01 M (Na *)sodium citrate. pH 3.6, and 
elution was achieved with a linear gradient of 0.0-0.2 M NaCl at a flow rate of 100 
ml/hr over a total volume of 400 ml. Bars show the four fractions that were pooled for 
further analysis. Reprinted from Mayne e¢ al. (1985) with permission. 


was resolved by CM-cellulose chromatography into four components 
that were called LMW 1 (M, 11,000), LMW 2A (M, 10,600), LMW 2B 
(M, 10,500), and LMW 3 (M, 10,600), as is shown in Fig. 4. The mo- 
lecular weights were determined by agarose gel filtration (Mayne et 
al., 1985). By HPLC fingerprinting, using several proteolytic enzymes, 


LMW 1 


Lew-Ar g-Lys~ Pro~Leur Se r Pr o~Gl y-Me t- Thr-Gl y~Arg-Hy p-Gl y-Pr o~Al a~Gly~Pr o-Hy p-Gly—Pr o-Hy p-Gl y~Al a~Thr- 


LMW 2A 
Gly-Gly-Val-Gly-Ala-Met-Gly-Pro-Hyp-Gly-Pr o-Hyp-Cly-Pro-Hyp-Gly-Pr o~Hy p-Gly-Glu- 

LW 28 

Ala-Lys-Arg-Ala-Ala-Leu-Gl y-Gly-Val-Gly-Ala-Met-Gly-Pro-Hyp-Gly-Pro-Hyp-Gly-Pro- 

Lys-Arg-Ala-Ala-Leu-Gl y-Gly-Val-Gly-Ala-Met-Gly-Pro-Hyp-Gly-Pro-Hyp-Gly-Pro-Hyp 

Im 3 
(Major) 

Ser-Lew-Arg-Arg-Pro-Glur Phe-Gly-Ala-Hyp-Gly-Leu-Hyp-Gly-Arg-Hyp-Gly-Pro-Hyp-Gly- 
LMW 3 
Minor) 


Leu-Ar g-Arg-Pro-Glur Phe-Gly-Ala-Hyp-Gly-Leu-Uyp-Gly-Arg-Hyp-Gly-Pr o-Hyp-Gly- 


Fıc. 5. Amino-terminal amino acid sequence analyses of LMW 1, 2A, 2B, and 3. For 
LMW 2B, two sequences were recognized in approximately equal proportions. Both 
contain additional lysine and arginine residues that are not present in the sequence of 
LMW 2A. This probably explains the difference in elution of LMW 2A and LMW 2B 
during CM-cellulose chromatography. Reprinted from Mayne et al. (1986) with permis- 
sion. 
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it was shown that LMW 2A and 2B are different cleavage products of 
the same parent chain. This was confirmed by N-terminal amino acid 
sequencing which showed that LMW 2B is five or six residues longer 
than LMW 2A and that the extra residues include a Lys-Arg sequence 
(Fig. 5). This will significantly increase the basic character of the 
fragment and probably explains the delayed retention time of LMW 
2B on CM-cellulose (see Fig. 4). 

LMW, after denaturation but without reduction of interchain dis- 
ulfide bridges, could not be fractionated by either CM-cellulose chro- 
matography (Mayne et al., 1985) or reverse phase-HPLC (M. van der 
Rest and R. Mayne, unpublished observations), suggesting that the 
three chains are present in a single molecular assembly. LMW was 
also visualized after rotary shadowing as a short rod (Mayne and von 
der Mark, 1983) and possessed a single sharp melting curve during 
denaturation (Bruckner et al., 1983). A model for the structure of 
LMW is also shown in Fig. 3. 


Il. THe Intact Type IX MOLECULE 
A. The Isolation of Intact Type IX Collagen 


1. CHEMICAL CHARACTERIZATION 


The intact form of type IX collagen is often referred to as the precur- 
sor form of type IX collagen, although it now seems likely that the 
molecule is not processed extracellularly (von der Mark et al., 1984; 
Kielty et al., 1985). It was initially reported to consist of two chains 
with apparent M, 68,000 and 84,000 (Bruckner et al., 1983; von der 
Mark et al., 1984), which correspond to the H and J chains earlier 
described by Gibson et al. (1983). 

In the above experiments, intact type IX collagen was isolated from 
chondrocyte cell culture or cartilage organ cultures as a radiolabeled 
molecule. It can be either precipitated from the medium of cell 
cultures with 30% ammonium sulfate (Bruckner et al., 1983; Gibson et 
al., 1983) or extracted from the pelleted chondrocytes with 1 M NaCl 
(von der Mark et al., 1984). Bruckner et al. (1983) further purified 
intact type IX collagen by immunoprecipitation with a polyclonal anti- 
body against HMW. These authors also showed, by immunoblotting, 
that the antibody specifically reacted with the M, 84,000 band and, to 
a lower extent, with the M, 68,000 band. The immunoprecipitated 
material also showed, after reduction, sharp bands at M, 84,000 and 
M, 68,000 in addition to a fuzzy band at M, 115,000 which was not 
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characterized at the time. von der Mark et al. (1984) obtained similar 
results for a preparation of intact type IX collagen (called pM) purified 
by DEAE-cellulose chromatography. In addition, they showed that 
bands of the same size as the HMW and LMW components (M1 and M2 
in their nomenclature) could be generated by pepsin digestion of the 
intact molecule. This result demonstrated for the first time that HMW 
and LMW are both present in a single parent molecule. 

A third chain, migrating as a fuzzy band at M, 115,000 on gel elec- 
trophoresis, was largely overlooked in these early reports. Bruckner et 
al. (1985), using a preparation purified by DEAE-cellulose chro- 
matography, showed that intact unreduced type IX collagen could be 
labeled with [3°S]sulfate (see Fig. 6). After reduction, radioactivity 
was present only in the fuzzy band at M, 115,000. The °5S label was 
removed by both chondroitinase ABC and bacterial collagenase treat- 
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Fic. 6. Fluorogram prepared after electrophoresis of radioactively labeled type IX 
collagen in 6% polyacrylamide in sodium dodecyl sulfate. Type IX collagen was initially 
purified from other collagens by DEAE-cellulose chromatography. Lane 1, Type I col- 
lagen standard. Lanes 2-6, {!4C]Glycine-labeled protein. Lanes 7-11, |25S]Sulfate-la- 
beled protein. Lanes 2 and 7, Unreduced control samples. Lanes 4 and 9, Reduced control 
samples. Lanes 3 and 8, Unreduced samples digested with chondroitinase ABC. Lanes 5 
and 10, Reduced samples digested with chondroitinase ABC. Lanes 6 and 11, Reduced 
samples digested with bacterial collagenase. Reprinted from Bruckner ei al. (1985) with 
permission. 
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ment. With a [!*C]glycine label, the M, 115,000 band disappeared 
after chondroitinase digestion while the M, 68,000 band increased in 
intensity (compare lanes 4 and 5, Fig. 6). These results suggest that 
sulfated glycosaminoglycan chains are present in type IX collagen and 
are specifically located in the M, 115,000 band. These results will be 
further discussed in Section IILC. 


2. ROTARY SHADOWING OBSERVATIONS 


Rotary shadowing was also used to recognize the intact form of type 
IX collagen as secreted by a suspension culture of embryonic chick 
chondrocytes (Irwin et al., 1985). A gallery of these molecules is shown 
in Fig. 2. The type IX collagen molecules were originally recognized in 
the preparation by inclusion before rotary shadowing of a monoclonal 
antibody prepared against the HMW fragment. The location of the 
epitope for this antibody was at the end of the long arm of HMW as 
shown by rotary shadowing observations. The antibody was observed 
to bind to the intact type IX molecules at the expected location of the 
HMW fragment [Irwin et al. (1985) and see below]. 

In separate experiments, a large form of type IX collagen (M, 
225,000) was isolated from rat chondrosarcoma (Duance et al., 1984), 
and its relationship with the pepsin fragment previously called type M 
collagen by Shimokomaki et al. (1980) was demonstrated by immu- 
noassay using a polyclonal antibody raised against the type M frag- 
ment. The large form of type IX collagen was examined by rotary 
shadowing, and a prominent kink was observed in some of the mole- 
cules. The knob at the amino-terminus was not, however, visualized. 
This result suggests that the amino-terminus may have been lost ei- 
ther by cleavage within the tumor matrix or during the isolation 
procedures. 


B. Comparison between Protein and cDNA Sequences 


Most of the detailed structure of type IX collagen was unraveled 
from the sequencing of cDNA clones encoding portions of two of the 
constituent chains and by comparison of these sequences with selected 
amino acid sequences of pepsin-resistant fragments (see Fig. 7). The 
first cDNA studied was called pYN 1738 (Ninomiya and Olsen, 1984). 
It was prepared from a chicken cartilage cDNA library, and its charac- 
terization and isolation are discussed in more detail by Vasios et al. 
(this volume). The structure of the intact type IX molecule, as derived 
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Fic. 7. A model for the structure of type IX collagen showing the relationship between 
the pepsin fragments HMW and LMW and the three chains of the intact molecule. The 
regions for which the amino acid sequences were derived from cDNA sequences are 
shown by heavy lines. The regions for which the amino acid sequences were established 
by protein sequencing are indicated by dark boxes. Light lines indicate regions of un- 
known sequences. 


from the nucleotide sequence of this cDNA, was compatible with a 
model which contained both the HMW and LMW fragments 
(Ninomiya and Olsen, 1984). All of the HMW and LMW fragments 
were subjected to partial N-terminal amino acid sequence analysis. 
The sequences of the HMW C2 and of the LMW 3 chains were found to 
correspond to sequences present in the protein encoded for by the nu- 
cleotide sequence of pYN 1738. The only exception was for the prolines 
in the Y position of the Gly-X-Y triplets which were posttransla- 
tionally modified to hydroxyprolines (van der Rest et al., 1985). In 
addition, all of the pepsin fragments of HMW and LMW were digested 
with trypsin and fingerprinted by HPLC. The amino acid composition 
of all of the peaks of these peptide maps was determined. These com- 
positions were compared with the compositions predicted for the tryp- 
tic fragments from the cDNA sequence of pYN 1738. All of the cDNA- 
predicted tryptic peptides were recognized in the tryptic digest of 
LMW 3, except for the last nine residues. These were probably re- 
moved during pepsin digestion. Because of the larger size of the HMW 
fragments, the tryptic fingerprint of HMW after HPLC was more com- 
plex. However, seven tryptic peptides of HMW C2 could be unam- 
biguously identified with peptides predicted from the sequence of pYN 
1738. The chain encoded for by pYN 1738 was called the a1(IX) chain 
(van der Rest et al., 1985). and it was concluded that it contains HMW 
C2 and LMW 3 as shown in Fig. 7. 

A second cDNA clone, pYN 1731, was subsequently described which 
partially encodes for the «2(IX) chain (Ninomiya et al., 1985). In this 
case, however, the cDNA did not extend to cover the N-terminus of the 
HMW fragments. A large tryptic fragment of HMW C3 was nev- 
ertheless recognized which, from its amino acid composition, appeared 
to be encoded for in the sequence of pYN 1731. A partial N-terminal 
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amino acid sequence of this tryptic peptide was obtained and con- 
firmed this identification. The LMW 2 fragment was also shown to be 
encoded for in pYN 1731 both by partial amino acid sequence analysis 
and by amino acid composition of the tryptic fragments. It was con- 
cluded that the a2(IX) chain is comprised of the HMW fragments C4 
and C3 and the LMW 2 fragment (see Fig. 7). 

No cDNA has yet been described for the «3(IX) chain. By elimina- 
tion, it can be inferred that it contains HMW C5 and LMW 1. 


C. Type IX Collagen Is a Proteoglycan 


Noro et al. (1983) first isolated a collagenase-sensitive, disulfide- 
bonded proteoglycan from chick embryo epiphyseal cartilage. It was 
named PG-Lt. Subsequently, this proteoglycan was shown to be the 
same as type IX collagen, based on comparable chromatographic be- 
havior and immunoblotting experiments using a polyclonal antibody 
prepared against the HMW component of type IX collagen (Vaughan et 
al., 1985). For these latter experiments, the same preparation method 
as Noro et al. (1983) was used, except that chicken sterna were used as 
a source of cartilage. There are, however, some differences between 
the two reports. One difference is strictly methodological and refers to 
the molecular weights as determined by gel electrophoresis. In the 
publication of Noro et al. (1983), globular proteins were used as stan- 
dards while Vaughan et al. (1985) used collagen standards. If the dif- 
ferences in mobilities between collagen and globular proteins are 
taken into account, then the two reports are completely comparable. A 
second difference stems from the apparent absence of the M, 68,000 
band in the paper of Noro et al. (1983). This is probably explained by 
the use of serine for protein labeling, resulting in very weak inten- 
sities of the bands after fluorography. The M, 68,000 band might then 
have been missed. Finally, Noro et al. (1983) suggested that PG-Lt has 
a high content of dermatan sulfate, while Vaughan et al. (1985) found 
that the glycosaminoglycan chains were fully digested to disac- 
charides by chondroitinase AC, suggesting the absence of dermatan 
sulfate. The different cartilages that were used by the two groups may 
explain this discrepancy. Noro et al. (1983) used the epiphyses of em- 
bryonic chick tibia while Vaughan et al. (1985) used chicken sterna. 
Both groups found a ratio of four- to six-sulfated disaccharides of 3:1, 
which is much higher than the 1:3 ratio in the aggregating proteogly- 
can of chick hyaline cartilage. 

The work of Vaughan et al. (1985), as described above, was comple- 
mented by another report from the same group [Bruckner et al. 
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(1985)]. In both papers, the existence in the intact type IX collagen 
molecule of a third chain of M, 115,000 associated with glycosamino- 
glycan was demonstrated. These two reports establish that one or per- 
haps two chondroitin sulfate chains are covalently linked with a type 
IX chain. This is based on [?°S]sulfate labeling of the M, 115,000 chain 
and removal of the label by chondroitinase ABC or AC digestion (see 
Fig. 6). Subsequently, the glycosaminoglycans were shown to be ex- 
clusively linked to the a2(IX) chain. By DEAE-Trisacryl chromatogra- 
phy, Huber et al. (1986) separated the three chains oftype IX collagen 
(Fig. 8). By comparison of the tryptic fingerprints obtained for the 
pepsin fragments C3 and C4 and the isolated, intact, sulfated chain 
(Fig. 8, peak 3), it was concluded that the glycosaminoglycan-carrying 
chain is a2(IX) (Fig. 9). By the same technique, the M, 84,000 band 
(Fig. 8, peak 2) was identified as «1(IX) and the M, 68,000 band (Fig. 8, 
peak 1) as «3(IX) (Huber et al., 1986). These results were independent- 
ly confirmed by Konomi et al. (1986), who used polyclonal antibodies 
raised against synthetic peptides that correspond to known amino acid 
sequences for the three chains. These authors demonstrated, by immu- 
noblotting, that there was specific labeling of the M, 115,000 band 
with an antibody raised against a synthetic peptide from the a2(IX) 
chain (Fig. 10). However, using this antibody, immunoblotting was 
only successful if the migration of the M, 115,000 band was reduced to 
M, 68,000 by chondroitinase ABC treatment (Fig. 10, lanes 4 and 5). 
Similarly, the M, 84,000 band was specifically labeled with an anti- 
body against a synthetic peptide for the «1(IX) chain (Fig. 10, lanes 2 
and 3) and the M, 68,000 band with an antibody against a synthetic 
peptide for the a3(IX) chain (Fig. 10, lanes 6 and 7). 

These results show that chondroitin sulfates are only present on the 
a2(IX) chain and provide further evidence that the three chains of type 
IX collagen are part of the same triple helix. This is demonstrated by 
the observation that intact and unreduced type IX migrates on SDS- 
PAGE as a single band whose molecular weight is reduced by chon- 
droitinase treatment (see Fig. 6). If a1(IX), or a3(IX), existed, these 
molecules should be insensitive to chondroitinase treatment. 


Fic. 8. Separation of the three chains of type IX collagen. (A) The reduced and alky- 
lated chains were chromatographed on a DEAE-Trisacryl column in 7.5 M urea at 40°C. 
The «3(IX) chain (peak 1) eluted with the starting buffer, followed by the al(IX) chain 
(peak 2) and the «2(IX) chain (peak 3). Fractions, pooled as shown by bars, were ana- 
lyzed by SDS-PAGE and subsequent fluorography (B). Lanes a and b, Purified type IX 
collagen nonreduced and reduced. Lanes c, d, and e, Aliquots from peaks 1, 2, and 3, 
respectively. Reprinted from Huber et al. (1986) with permission. 
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Fic. 9. Tryptic peptide profiles of an equimolar mixture of the C3 and C4 components 
of HMW (solid line) and of the radiolabeled 115 kDa chain (Fig. 8, peak 3) of intact type 
IX collagen (broken line). The tryptic digestions were performed separately. The diges- 
tion products were mixed and chromatographed together on a C18 Vydac TP 201 column, 
eluted in 9 mM trifluoroacetic acid using an aqueous acetonitrile gradient (0-32%). 
Fractions were collected at 1 min for radioactivity measurements. 


The precise attachment site for the glycosaminoglycan side chain is 
not known at present, nor has the presence of a specific linkage region 
to serine residues been demonstrated. Based on primary structure 
data, the NC1 and NC2 domains of the a2(IX) chain are probably 
excluded, as they do not contain serine residues (Ninomiya et al., 
1985). The three collagenous domains can also be excluded, as the 
pepsin fragments HMW and LMW do not contain glycosaminoglycan 
chains. The possible location of the glycosaminoglycan side chain at 
the NC3 and/or at the NC4 domain is presently actively being investi- 
gated. Preliminary results (Irwin and Mayne, 1986) were obtained by 
rotary shadowing of type IX collagen in the presence of monoclonal 
antibodies 9A2 and 2B6 which recognize stubs of chondroitin 4-sulfate 
or dermatan sulfate generated after chondroitinase ABC digestion 
(Caterson et al., 1985). Specific binding of both of these antibodies to 
the NC3 domain was observed, and this binding was dependent on 
prior digestion with chondroitinase. These results, therefore, indicate 
that the chondroitin sulfate may be located at the NC3 domain and 
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Fic. 10. Immunoblot of type IX collagen with antibodies prepared against synthetic 
peptides derived from nucleotide or amino acid sequences of the al(IX), «2(IX), and 
a3(IX) chains. Lane 1 shows the result of staining the gel with Coomassie Brilliant Blue. 
Arrows indicate the location of the M, 84,000 and M, 68,000 bands (collagenous stan- 
dards). Lanes 2—7 show the result of staining nitrocellulose blots with peptide-specific 
antibodies. Note that with antibodies against «1(IX) (lanes 2 and 3) or «3(X) (lanes 6 and 
7) the location of the band remained unchanged after chondroitinase ABC digestion (+). 
For the a2(IX) chain, immunoblotting of a M, 68,000 band only occurred after 
chondroitinase digestion (lanes 4 and 5). Without chondroitinase ABC digestion (—), the 
«2(IX) chain appears as a broad band of average M, 115,000 (see Fig. 6, lane 9); this may 
explain the failure to obtain a successful immunoblot. Reprinted from Konomi et al. 
(1986) with permission. 


must be present on a small peptide which is liberated from HMW 
during pepsin cleavage between C3 and C4. 

Vaughan et al. (1985) demonstrated incorporation of [?H]mannose 
into type IX collagen. This incorporation was not restricted to a single 
chain, and both the M, 84,000 [a1(IX)] chain and M, 68,000 [a3(IX)] 
chain were labeled. Whether the M, 115,000 [w2(IX)] chain was also 
labeled is not clear from their results. Therefore, type IX collagen also 
apparently possesses mannose-containing oligosaccharide units. 


IV. THE STRUCTURE or TYPE IX COLLAGEN 


A. Nomenclature 


The type IX collagen molecule is now clearly shown to be composed 
of seven domains that are not always of equal length in the three 
chains. Their triple-helical domains must, however, be in register to 
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give the observed helical stability. In order to facilitate the identifica- 
tion of individual amino acids while permitting easy comparison be- 
tween the three chains, we will use the following nomenclature: (1) 
amino acid position in a domain from the amino-terminus, (2) domain 
name, and (3) chain number. For example, a cysteine residue is found 
at position 5-NC1-a1(IX). This translates as the fifth amino acid start- 
ing from the amino end of the NC1 domain of the «al(IX) chain. In 
earlier publications, a different nomenclature was used: the number- 
ing of amino acids referring to their position in cDNA pYN 1738 
(Ninomiya and Olsen, 1984). 


B. The Seven Domains of Type IX Collagen 


1. Tue NC1 Domain 


The NC1 domain is very short, consisting of 21 residues for a1(IX) 
and 15 residues for «2(IX), as determined from cDNA sequence analy- 
sis of the clones pYN 1738 (van der Rest et al., 1985) and pYN 1731 
(Ninomiya et al., 1985). Comparison of these results with structural 
analyses performed on pepsin fragments indicates that very little, if 
any, of the NC1 domain of the al(IX) and a2(IX) chains is removed by 
processing (van der Rest et al., 1985). Indeed, the presence of all of the 
residues except the last nine amino acids was demonstrated in the 
pepsin fragments of the LMW 3 fragment of the al(IX) chain. The 
exact size of the NC1 domain of a3(IX) is not known at present, as a 
cDNA has not been identified for this chain. The NC1 domain of both 
a1(IX) and «2(IX) contains one cysteine residue at position 5-NC1. The 
a3(IX) NC1 domain contains a cysteine at the same position as demon- 
strated by amino acid sequencing of a tryptic peptide containing the C- 
terminal portion of COL1 and the pepsin resistant portion of NC1 (M. 
van der Rest, unpublished results). In the three chains, another cys- 
teine is found in the Y position of the last triplet of the COL1 domain, 
so that the three chains may be held together by pairs of interchain 
disulfide bridges. 


2. THe COLI Domain 


The COL1 domain is 115 residues long for both the al(IX) and 
a2(IX) chains. It contains two imperfections in the triple helix. The 
first one is a Gly-X-Gly imperfection, with X being Arg, located at 
residues 61-COL1 to 63-COL1 for both a1(IX) and a2(IX). The second 
imperfection, Gly-X-Y-X-Y in both chains, is located at residues 78- 
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COL1 to 82-COL1 in «1(IX) and at residues 81-COL1 to 85-COL1 in 
a2(IX) (Ninomiya et al., 1985). The 3-residue shift in the location of 
this imperfection may have a direct effect on the conformation of the 
native molecule at this site. However, further study of this conforma- 
tional effect will require the sequence of the «3(IX) chain in the same 
region. This sequence is not yet available. The last amino acid of the 
COL1 domain is a cysteine in a1(IX), &2(IX), and a3(IX). The implica- 
tion of this result in terms of disulfide-bond formation will be dis- 
cussed in Section IV,C. 

Since the COL1 domain has been the most extensively studied at the 
protein level, it is possible to give some information on the extent of 
posttranslationl modifications. In al(IX), most prolines and all 
lysines in the Y position are hydroxylated. We also have evidence for 
only partial hydroxylation of at least one proline residue, since a tryp- 
tic fragment [37 in van der Rest et al. (1985)] was found in two peaks 
eluting 3 min apart on HPLC and differing only in the hydroxylation 
of one proline (M. van der Rest, unpublished observations). Glycosyla- 
tion of hydroxylysine residues has not been directly demonstrated. 
However, the resistance to trypsin of the peptide bond next to hydrox- 
ylysines 36-COL1-a1(IX) and 48-COL1-a1(IX) and of hydroxylysines 
48-COL1-a2(IX) and 54-COL1-a2(IX) indicates that they are glycosy- 
lated. No analysis for 3-hydroxyproline has been reported yet. 


3. THe NC2 Domain 


The NC2 domain is 30 residues long in both a1(IX) and «2(IX). The 
size and structure of the NC2-a3(IX) domain is not known. In addition 
to probably providing a limited flexibility to the molecule, and also 
being cleaved by pepsin, this domain has no remarkable property. 
Sequencing of cDNA clones shows that there is a cysteine residue at 
position 10-NC2-a1(IX) with no equivalent in NC2-a2(IX). The three 
chains are, thus, not covalently held together at this domain. Since 
most of the NC2 domain is cleaved by pepsin, very few data are avail- 
able for this domain at the protein level. 


4. THe COL2 Domain 


The COL2 domain of a1(IX) is 339 residues long and contains no 
imperfections in the triple helix. Its primary structure was deduced 
from the sequence of the cDNA clone pYN 1738 (Ninomiya and Olsen, 
1984). Several tryptic fragments from the pepsin-resistant fragment 
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HMW C2, which covers COL2, NC3, and COL3, were purified and 
analyzed by amino acid composition (van der Rest et al., 1985). As was 
previously observed for the COL1 domain, these data showed the pres- 
ence of the usual posttranslational hydroxylations of proline and 
lysine residues. A partial sequence of COL2 from «2(IX) was derived 
from a cDNA, pYN 1731, which covers the last 117 residues of the 
domain (Ninomiya et al., 1985). N-terminal amino acid sequence anal- 
ysis of a large tryptic fragment starting at residue 255-COL2-a2(IX) 
was obtained and confirmed the identity of the pepsin-resistant frag- 
ment containing COL2 (HMW C3) as being derived from the a2(IX) 
chain. 


5. THe NC3 DOMAIN 


The NC3 domains of the a1(IX) and a3(IX) chains are resistant to 
proteolysis by pepsin while this domain in the «2(IX) chain is sensitive 
to pepsin, resulting in a cut between COL2 and COL3 of a2(IX) (HMW 
C3 and C4) after pepsin solubilization (Reese et al., 1982) (Fig. 3). The 
primary structure of NC3 of a1(IX) was derived from cDNA sequence 
analysis (Ninomiya and Olsen, 1984; van der Rest et al., 1985). It is 12 
residues long and contains two cysteine residues at positions 6-NC3 
and 10-NC3. The structure of the NC3 domain of «2(IX) is presently 
actively being investigated (McCormick et al., 1987), since immunolog- 
ical evidence (described above) suggests that it is the attachment site 
of the glycosaminoglycan side chain (Irwin and Mayne, 1986). Except 
that it is resistant to proteolysis by pepsin, nothing is known of the 
structure of NC3 from a3(IX). 


6. THE COL3 DOMAIN 


The COL3 domain of a1(IX) is 137 residues long and contains one 
Gly-X-Gly imperfection at residues 87-COL3 to 89-COL3, where X is 
Lys. The COL3 domain of «2(IX) is contained in the HMW C4 pepsin 
fragment (Reese et al., 1982). It is not disulfide linked to the rest of 
HMW and is characterized by a very high hydroxyproline content (161 
residues/1000) (Reese et al., 1982). The cDNA-derived sequence of the 
COL1 domain of a1(IX) contains 19 prolines in position Y, which are 
likely sites of hydroxylation (Ninomiya and Olsen, 1984). This repre- 
sents 138 residues per 1000. This high hydroxyproline content would 
be expected to increase the stability of the triple helix of the COL3 
domain. This is probably reflected by the complex melting curve of 
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HMW, which contains both the COL2 and COL3 domains and only a 
part of which melts at a high temperature (Bruckner et al., 1983). In 
addition, it also explains why HMW C4 remains linked to the other 
disulfide-bonded fragments in 1 M CaCl, during agarose gel filtration 
at room temperature (Reese et al., 1982). Whether this increased sta- 
bility of COL3 has any biological significance is, at present, unknown. 


7. THE NC4 DOMAIN 


Complete information on the size and structure of NC4 is still lack- 
ing both at the DNA and protein levels. The cDNA pYN 1738 only 
covers 85 residues and contains one cysteine residue. The a1(IX) chain 
has an apparent molecular weight of 84,000, while chondroitinase- 
treated &2(IX) is only 68,000, almost co-migrating with «3(IX) (Huber 
et al., 1986; Konomi et al., 1986). Since they do not differ significantly 
in size in the other regions, it is likely that the difference in molecular 
weight is due to an extended NC4 domain in al1(IX). This is in keeping 
with the differences in sizes of the mRNAs of a1(IX) and a2(IX) 
(Ninomiya et al., 1985). The NC4 domain of a1(IX) was recently pu- 
rified from a collagenase digest of chicken embryo sternal cartilage 
(Vasios et al., 1987). Attempts to obtain an amino acid sequence of this 
peptide were unsuccessful, suggesting that the N-terminus may be 
blocked. However, sequence analysis of tryptic fragments confirmed 
the identity of the purified fragment when comparison was made with 
the sequence of an al(IX) genomic DNA clone (Vasios et al., 1987). The 
NC4 domain of a1(IX) behaves as a very hydrophobic entity and has 
an apparent M, of 23,000 on SDS-PAGE. It may contain intrachain 
disulfide bridges but there is no evidence for interchain disulfide 
bridges between a1(IX) and the other two chains in this region. 


C. Disulfide Bridges in Type IX Collagen 


Type IX collagen must contain at least two sets of interchain dis- 
ulfide bridges, since both HMW and LMW contain interchain disulfide 
bridges. A pair of cysteines are found at the C-terminal end of the 
molecule at residues 115-COL1 and 5-NC1 in the three chains. They 
must be held together by these bridges since the pepsin-solubilized C- 
terminal fragment (LMW) is obtained as a single, disulfide-bonded 
entity which is cleaved into three chains after reduction (Mayne et al., 
1985). 


214 MICHEL VAN DER REST AND RICHARD MAYNE 


A set of cysteine residues is also found in a1(IX) in the NC3 domain 
(residues 6-NC3-a1(IX) and 10-NC3-a1(IX)]. The «2(IX) chain must be 
linked to a1(IX) by one or both of these cysteines since the only other 
cysteine found in a1(IX) that could contribute to the disulfide bands of 
the HMW pepsin fragment (Reese et al., 1982) is located at position 10- 
NC2-a1(IX) and has no equivalent in a2(IX). 

One remarkable feature of the cysteines at the C-terminus is that, 
in each chain, they are separated by four residues. The way the cys- 
teines are cross-linked is not known. In type III collagen, the two 
cysteines of each chain mark the transition between the helical and 
nonhelical domain and are in adjacent positions so that the bonds can 
go from a cysteine occupying the Y position of the last triplet in one 
chain to the cysteine occupying the first nonhelical position of an 
adjacent chain (Glanville et al., 1976). The types of conformations 
possible with the much more distant cysteines of type IX collagen 
remain to be established. A complex pattern of disulfide bonds can also 
be expected in the NC3 domain. 

The NC4 domain of a1(IX) may contain intrachain disulfide bonds, 
since its apparent molecular weight increases slightly upon reduction 
(Vasios et al., 1987). 


D. Lysine-Derived Cross-Links 


The only direct analysis of lysine-derived cross-links in type IX col- 
lagen was reported by Wu and Eyre (1984) on pepsin fragments iso- 
lated from bovine articular cartilage. The cross-links studied were the 
fluorescent hydroxypyridinium residues. They were found exclusively 
in the high-molecular-weight fraction at a concentration of 0.45/1090 
residues, which is similar to the concentration in type II collagen 
(0.42/1000 residues). There was, however, a difference in the ratio of 
the hydroxylysyl form of pyridinoline to the lysyl form of pyridinoline 
(8:1 in type IX, compared to 100:1 in purified type II collagen). These 
numbers are only representative of the sample analyzed (articular 
cartilage of 2-year-old steers) and are likely to differ for other species, 
tissues, or age groups. Furthermore, it is possible that additional cross- 
links may be present in the pepsin-sensitive regions of the molecule. 

Cross-linking in the chicken HMW pepsin fragment of type IX col- 
lagen is indirectly suggested by the presence of a fragment of M, 
87,500 (C1) after reduction of interchain disulfide bridges. This frag- 
ment is thought to be a cross-linked peptide between C3 and C2 or C5 
(Reese et al., 1982). 
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V. BIOSYNTHESIS AND CLEAVAGE OF THE INTACT 
MOLECULE BY PROTEASES 


The biosynthetic form of type IX collagen appears on SDS-PAGE as 
a single broad band of apparent M, 250,000 (Bruckner et al., 1983, 
1985; Gibson et al., 1983; von der Mark et al., 1984). The apparent 
heterogeneity of the molecule appears to arise from the presence of 
variable amounts of chondroitin sulfate. Kielty et al. (1985) recently 
examined explants of chick embryo cartilage for evidence of extra- 
cellular processing for type IX collagen and were unable to demon- 
strate any reduction in the molecular weight of this collagen before it 
is incorporated into the cartilage matrix. 

The susceptibility of the biosynthetic form of chicken type IX col- 
lagen to chymotrypsin was investigated, and two disulfide-bonded ag- 
gregates of M, 210,000 and 43,000 were isolated (Clark and Richards, 
1985). The larger fragment gave rise on SDS-PAGE to a single band 
of M, 61,000 after reduction, suggesting that cleavage by chymotryp- 
sin does not occur at the NC3 domain of any of the three chains (see 
Fig. 7). Another group also observed a similar fragment of M,. 210,000 
after limited pepsin digestion of type IX collagen, but fragments of M, 
140,000, 69,000, and 40,000 were observed after reduction (Yasui et 
al., 1984). This result suggests that mild pepsin digestion may also 
cause internal cleavages in the molecule. Further pepsin digestion 
resulted in a smaller fragment of M, 153,000 which appeared similar 
to HMW. In both of these studies, the fragments generated after pro- 
tease digestion were not completely characterized. However, the re- 
sults are entirely consistent with the model for type IX collagen as 
presented in Fig. 7. 


VI. IMMUNOFLUORESCENT LOCALIZATION OF TYPE IX 
COLLAGEN IN CARTILAGE MATRIX 


Several groups have prepared polycolonal antibodies against the 
pepsin-resistant fragments of type IX collagen isolated from either 
porcine or bovine cartilages, and subsequently used the antibodies to 
investigate the immunofluorescent localization of type IX collagen in 
articular cartilages (Duance et al., 1982: Ricard-Blum et al., 1982) or 
bovine nasal septum (Evans et al., 1983). In general, the results sug- 
gest that type IX collagen is predominantly present in the pericellular 
matrix which immediately surrounds the chondrocytes and may con- 
tain a higher proportion of proteoglycans (Poole et al., 1982). The in- 
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tensity of immunofluorescent staining was markedly increased after 
the use of testicular hyaluronidase digestion to remove “masking” 
proteoglycans, as previous described for type II collagen in chick car- 
tilages (von der Mark et al., 1976). Electron microscopic analyses have 
shown that the pericellular capsule of articular cartilage (Poole et al., 
1984) or the inner territorial matrix of growth-plate cartilage (Eggli et 
al., 1985) consists of a fine “feltlike” meshwork of collagen fibrils 
unlike the wider, cross-banded fibrils of type II collagen found in the 
interterritorial matrix. In one of these studies, an attempt was also 
made to correlate the morphology of cartilage after hyaluronidase di- 
gestion with the observed structure of the pericellular capsule (Poole et 
al., 1985). It was speculated that type IX collagen, and possibly type XI 
(la2a3a) collagen, may be involved in organizing or stabilizing the 
pericellular capsule, or possibly could form the fibrils which make up 
this matrix. An immunoperoxidase procedure was also used to obtain 
electron microscopic localization of type IX collagen in fetal calf epi- 
physeal cartilage (Hartmann et al., 1983). The results suggested a 
preferential localization to the fine fibrils around the chondrocytes 
rather than to the larger cross-banded fibrils of type II collagen. 

In other studies, a monoclonal antibody was prepared against the 
high-molecular-weight fragment of chicken type IX collagen and used 
for immunofluorescent localization of type IX collagen in a variety of 
cartilages (Irwin et al., 1985; Mayne and Irwin. 1986). The antibody 
was of sufficiently high affinity that the location of the epitope along 
the intact type IX molecule could be viewed by rotary shadowing (Ir- 
win et al., 1985). Immunofluorescent staining of embryonic or adult 
chicken sterna with this antibody after hyaluronidase digestion 
showed codistribution with type II collagen throughout the cartilage 
matrix and not specifically in a pericellular location (Irwin et al., 
1985). Similar results were obtained with articular cartilage from 17- 
day chick embryos in which the type IX collagen was present through- 
out the cartilage matrix together with type II collagen, except on the 
articular surfaces where type I collagen predominated (Mayne and 
Irwin, 1986). Immunofluorescent staining of hypertrophic cartilage 
from 17-day-old chick embryo tibiotarsus also showed codistribution of 
type II and type IX collagens (Fig. 11). Similar results confirming the 
codistribution of type II and type IX collagens in several chick car- 
tilages were found with a polyclonal antibody prepared against HMW 
(Müller-Glauser et al., 1986). 

Somewhat different results were observed when sections were not 
treated with hyaluronidase prior to immunofluorescent staining. In 
embryonic chick cartilage, some immunofluorescent staining was ob- 
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Fic. 11. Immunofluorescent staining of the diaphysis of 17-day chick embryo tibiotar- 
sus with monoclonal antibody EB6 to type II collagen (A) and monoclonal antibody 2C2 
to type IX collagen (B). Arrowheads show sites of erosion of chondrocyte lacunae and 
asterisks show the surface of the cartilage. x363. 


served in the center of the cartilage for type IX collagen but not for 
type II collagen (Irwin et al., 1985), suggesting that type IX collagen 
may not be so extensively masked by proteoglycans. Also, in adult 
quail sternum without hyaluronidase digestion, some fluorescent 
staining occurred in a pericellular location for type IX collagen but not 
for type II collagen (Irwin et al.,1985). These results suggest that con- 
siderable caution should be exercised in interpreting immunolocatiza- 
tion experiments with cartilage, especially if the hyaluronidase diges- 
tion is incomplete. Recently, Poole et al. (1985) investigated the remov- 
al of proteoglycans from canine tibial cartilage by hyaluronidase di- 
gestion. Ruthenium red staining showed that digestion for up to 10 hr 
resulted in little change in proteoglycan concentration, and it was 
necessary to digest for 15-20 hr to remove completely all of the pro- 
teoglycan aggregates. These digestion times with hyaluronidase are 
far longer than are customarily employed before immunofluorescent 
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staining. In addition, the pericellular matrix was observed to collapse 
after hyaluronidase digestion so that only a pericellular capsule was 
observed. 


VII. SUMMARY AND CONCLUSIONS 


Type IX collagen differs from the interstitial collagens in several 
important respects. The intact molecule is unusual in possessing three 
collagenous domains, each of which is interspersed by a short, non- 
collagenous domain. One of these noncollagenous domains (NC3) ap- 
pears to be highly flexible when the molecule is visualized by rotary 
shadowing (Irwin et al., 1985). In addition, the size and location of the 
exons in the type IX collagen genes are clearly different when com- 
pared to the fibril-forming collagens (Lozano et al., 1985). The exons 
within the interstitial collagen genes appear to have originated from a 
unit of 54 base pairs [see Kühn, this volume; Kühn (1984)], but for 
type IX collagen no evidence was found for such a repeated structure 
(Lozano et al., 1985). Type IX collagen may, therefore, be unrelated or 
only distantly related to the fibril-forming collagens, and it seems 
unlikely that it will be able to form collagen fibrils of the conventional 
type. It is, however, possible that the molecule may interact with 
fibrils of type II collagen and could be involved in organizing or main- 
taining the 3-dimensional meshwork of fine collagen fibrils found in 
many cartilage matrices. This possibility has been suggested several 
times (Wu and Eyre, 1984; Bruckner et al., 1985: Mayne et al., 1985), 
and recently some electron microscopic evidence was presented to sup- 
port such a potential function (Müller-Glauser et al., 1986). In car- 
tilage matrices, type IX collagen is always present with type II col- 
lagen, and both are present in bovine vitreous humor, which also 
contains a fine meshwork of collagen fibrils (Ayad and Weiss, 1984). 
The synthesis of type IX collagen is also apparently initiated at the 
same time as the synthesis of type II collagen during chondrogenesis 
in chick limb buds (Kimura et al., 1985). 

The apparent preferential immunolocalization oftype IX collagen to 
a pericellular location in several mammalian cartilages (Duance et al., 
1982; Ricard-Blum et al., 1982) might also suggest an involvement in 
organizing a 3-dimensional meshwork. Only in the pericellular region 
of the epiphyseal cartilage of adult mammals is a fine meshwork of 
collagen fibrils formed, whereas in the territorial and interterritorial 
matrices the collagen fibrils are larger and not so extensively orga- 
nized (Poole et al., 1985; Eggli et al., 1985). In addition, the identifica- 
tion of the trivalent hydroxypyridinium cross-link in some of the type 
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IX collagen molecules of bovine articular cartilage (Wu and Eyre, 
1984) suggests a potential means of cross-linking type IX collagen to 
type II collagens. However, a population of type IX collagen in chicken 
cartilage matrix is clearly not cross-linked in this manner and can be 
solubilized from cartilage matrices in 4 M guanidine HCl (Noro et al., 
1983; Wu and Eyre, 1984; R. Mayne, unpublished observations). 

It was initially estimated that type IX collagen constitutes only 5% 
of the collagen that can be solubilized from the chicken sternum with 
pepsin (Reese and Mayne, 1981). However, recent biosynthetic experi- 
ments with embryonic chick cartilages suggest that the synthesis of 
type IX collagen relative to type II collagen may be more than pre- 
viously estimated and, during biosynthesis, could constitute as much 
as 20% of the synthesis of type II collagen (Yasui et al., 1984; Kielty et 
al., 1985; Vaughan et al., 1987). Therefore, it seems highly likely that 
type IX collagen will play important functions in the assembly and 
structure of the extracellular matrix of cartilage. 
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I. INTRODUCTION 


Type X collagen possesses a number of unique characteristics that 
distinguish it from the other collagens. The molecule has a very re- 
stricted tissue distribution and is synthesized predominantly by hyper- 
trophic chondrocytes during the process of endochondral ossification. 
In the fetal and adolescent skeleton, the molecule is a transient inter- 
mediate in the cartilage which will be replaced by bone. In the adult, 
type X collagen persists in the zone of calcified cartilage which sepa- 
rates the hyaline cartilage from the subchondral bone. 

The molecule is 138 nm in length, approximately half the length of 
the interstitial collagen types I, II, III, and V. Although the triple- 
helical domain is smaller, it has greater thermal stability than type I 
or type II collagens, demonstrating a Tm of 47°C. Type X collagen is 
susceptible to cleavage by vertebrate collagenase, but it is cleaved at 
two sites within its helical domain rather than at a single locus, which 
is characteristic of other collagen types. Another intriguing and possi- 
bly unique characteristic is the structure of its gene, which contains 
only one intron. The genes for other vertebrate collagen types contain 
numerous introns. 


II. INITIAL DISCOVERY 
At approximately the same time, three separate laboratories work- 
ing with cultures of embryonic chick chondrocytes reported the pres- 
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ence of a short collagenous molecule which lacked interchain disulfide 
bonds (Schmid, 1980; Gibson et al., 1981; Capasso et al., 1982). One 
laboratory was studying the differentiation of chondrocytes from 
three different zones isolated from the tibiotarsus of 12-day chick 
embryos. The chondrocytes of embryonic long bones differentiate in a 
linear continuum from small proliferative cells (zone 1), to flattened 
cells with high biosynthetic capacity (zone 2), and then to hypertrophic 
cells (zone 3) (Stocum et al., 1979). The objective of the work was to 
determine whether the cells within different zones were biochemically 
different. If so, a second objective was to determine whether mono- 
layer chondrocyte cultures derived from the three specific zones of 
growth cartilage would maintain these zone-specific characteristics in 
culture. A variety of biochemical parameters were evaluated, includ- 
ing the types and rates of proteoglycan synthesis (Kim and Conrad, 
1977, 1980) and the types and rates of collagen synthesis (Schmid and 
Conrad, 1982a,b). Matrix vesicle production was also examined 
(Glaser and Conrad, 1981). 

Short-term organ cultures of tissue from zone 3 (hypertrophic cells) 
synthesized a collagenous protein with chains of lower molecular mass 
than al(ID chains (Schmid and Conrad, 1982a,b). The tissues from 
zone 1 (proliferative cells) and zone 2 (flattened cells) did not synthe- 
size the low-molecular-weight collagen. If the cells from each zone 
were dissociated and grown in monolayer culture for a week, the cells 
originating from both zone 2 and zone 3 now synthesized the small 
collagen molecule. The cells derived from all three zones, including 
zone 1, synthesized the small collagen molecule if the primary 
chondrocytes were passed into secondary cultures and maintained in 
monolayer culture for an additional week. The low-molecular-weight 
collagen appeared to be a specific marker for hypertrophic 
chondrocytes and showed that the chondrocyte monolayer cultures de- 
rived from different zones did not merely maintain their charac- 
teristics in culture but rather continued their program of development. 

The low molecular weight collagen has chains of M, 59,000, as ana- 
lyzed by SDS-PAGE, when compared to collagen standards. It was 
first thought to be a degradation product of a larger collagen molecule, 
but further experimentation failed to reveal evidence of a precursor 
molecule. Analysis of the molecule by cyanogen bromide (CNBr) pep- 
tide mapping showed that it was unrelated to any known collagen. The 
molecule was referred to as 59K collagen or short chain (SC) collagen. 
The low-molecular-weight collagen has now been given the designa- 
tion type X collagen. 

Another laboratory reported the synthesis of type X collagen by 
chick sternal chondrocytes in culture (Gibson et al., 1981, 1982). The 
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chondrocytes were isolated from 18-day sterna and cultured either in 
native collagen gels or directly on tissue culture dishes. The major 
difference between the cultures was the relative amount of a short 
collagen molecule (type X) secreted into the medium. The chondrocytes 
cultured on plastic synthesized the protein in trace amounts, whereas 
the chondrocytes in the collagen gels synthesized the molecule as the 
major [?H]proline-labeled product secreted into the medium. The mole- 
cule was degraded by bacterial collagenase and, by SDS-PAGE, it had 
chains of M, 59,000. By CNBr peptide analysis, it was unrelated to any 
known collagen type. Gibson et al. (1982) referred to the molecule as G 
collagen. 

A third laboratory observed the synthesis of type X collagen while 
studying the stability of embryonic chick chondrocytes in culture (Ca- 
passo et al., 1982). This group isolated chondrocytes from the tibiotar- 
sus of 19-day chick embryos and maintained the cells in culture as 
monolayers or as floating cells. Some of the attached chondrocytes 
were observed to spontaneously detach from the culture dishes, and 
the inclusion of ascorbic acid in the culture medium promoted the 
attachment of the floating chondrocytes. Both the floating and at- 
tached chondrocyte cultures synthesized type II collagen plus a low- 
molecular-weight collagenous protein with chains of M, 64,000 (64K) 
when compared with globular protein standards on SDS-PAGE. The 
64K chains were degraded by bacterial collagenase, and their secretion 
was dependent on the presence of ascorbic acid in the medium. The 
molecule appeared unrelated to type II collagen since it incorporated 
[25S}methionine at a faster rate. 

The above reports emphasize the importance of cell culture in the 
initial discovery and characterization of type X collagen. The molecule 
was probably not recognized by earlier investigators because of its low 
molecular weight, its solubility in acidic salt solutions (described 
later), and its restricted distribution to hypertrophic cartilage. The 
investigators who first observed the synthesis of type X were fortunate 
to have used sources of chondrocytes which were capable of producing 
the molecule and culture conditions which enhanced its synthesis. 
Type X collagen is not restricted to avian cartilage. A similar molecule 
has also been isolated from bovine (Remington et al., 1984; Grant et al., 
1985) and rabbit (Remington et al., 1983; Sussman et al., 1984) growth- 
plate cartilages. 


Ill. Source or TYPE X COLLAGEN 


Chondrocyte cell cultures were the first source of type X collagen, 
and harvesting conditioned medium from cultures continues to be the 
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most useful means of obtaining the intact molecule in quantity. In one 
procedure, chondrocytes are enzymatically released from the tibiotarsi 
of 12-day chick embryos to prepare primary monolayer cultures (Kim 
and Conrad, 1977; Schmid and Conrad, 1982a; Schmid and Linsen- 
mayer, 1983). After 1 week, the cultures reach confluence and are then 
passed into secondary culture. Such expanded mass cultures can be 
maintained and harvested for 6 weeks. There are two advantages to 
this method. The cells will synthesize type X collagen in increasing 
proportions (eventually, it constitutes the major collagen secreted into 
the culture medium), and type X can be harvested daily from the 
culture medium. 

Alternate methods for the isolation of type X collagen involve the 
proteolytic release of type X from cartilage (Kielty et al., 1984; Quarto 
et al., 1985) or the extraction of type X from cartilage which has been 
made lathyritic. Both of these methods suffer from a disadvantage in 
that type X collagen is a minor component of hyaline cartilage. In an 
18-day embryonic chick tibiotarsus, type X collagen represents approx- 
imately 2% of the collagen in the cartilage. Only a small proportion of 
the chondrocytes in cartilaginous tissues are hypertrophic, and even 
within the hypertrophic regions there exist considerable quantities of 
other collagen types (von der Mark and von der Mark, 1977; Schmid 
and Conrad, 1982b; Kielty et al., 1985; Grant et al., 1985). Proteolytic 
extraction has an additional disadvantage in that it destroys the non- 
helical regions of the molecule. 


IV. ISOLATION oF Type X COLLAGEN 


Mass cultures of confluent chondrocyte monolayers (10 150-mm 
plates) yield greater than 50 mg of type X collagen after a month of 
harvesting the medium (Schmid and Linsenmayer, 1983). The type X 
is initially isolated from the culture medium by the addition of 30% 
ammonium sulfate in the presence of protease inhibitors. The precipi- 
tate is recovered by centrifugation, solubilized, and dialyzed against a 
neutral buffer (50 mM Tris, 0.4 M NaCl, pH 7.4). To remove residual 
proteoglycans and serum proteins from the preparation, the 30% am- 
monium sulfate precipitation is repeated. 

The ammonium sulfate precipitate contains several different types 
of procollagen and collagen molecules. The type X collagen can be 
separated from most of the other collagen types by its solubility in 
acidic salt solutions. The types I and II collagens precipitate when 
samples are dialyzed against 0.5 M acetic acid containing 0.7 M NaCl. 
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The supernatant is enriched in type IX, type X, and type XI (la2a3qa) 
collagens. Type XI becomes insoluble when the NaC] concentration is 
raised to 1.2 M; the majority of the type X and type IX do not precipi- 
tate until the NaC] concentration of the acid solution is raised to 2 M. 
In practice, contamination of type X preparations with type XI and 
type IX collagens is slight since most of these molecules remain with 
the chondrocyte cell layer and are not present in the culture medium. 
Therefore, the collagen precipitated from the culture medium between 
1.2 and 2.0 M NaCl in 0.5 M acetic acid is almost exclusively type X 
collagen. If completely pure type X is desired, preparations can be 
further purified by gel filtration or ion exchange chromatography (see 
Section V). 

The use of cell culture as the source of type X collagen without 
resorting to proteolytic extraction demonstrated that the molecule con- 
tains both collagenous and noncollagenous domains (Gibson et al., 
1982; Schmid and Conrad, 1982a). The majority of the protein isolated 
by this method contains three identical chains of M, 59,000. 

The individual molecules were visualized by electron microscopy 
after rotary shadowing with platinum. As shown in Fig. 1A and B, two 
distinct domains of the molecule are evident in such preparations. One 
is a long, thin, rodlike region, characteristic of a collagen triple helix 
(Schmid et al., 1984). The other is a knoblike globular region at one end 
of the helical domain. 

The type X collagen gene has recently been isolated and sequenced 
(see Vasios et al., this volume; Ninomiya et al., 1986). The amino acid 
sequence of the molecule revealed a large nonhelical domain (NC1) of 
170 residues at the carboxy-terminus of the molecule, a helical domain 
of 460 residues, and a small nonhelical domain (NC2) at the amino 
terminus of the molecule. The knob which is observed by rotary shad- 
owing must, therefore, represent the carboxy-terminus of the mole- 
cule. 

Digestion of type X collagen with pepsin removes the globular re- 
gion of the molecule, leaving the triple-helical domain intact (Fig. 2, 
compare lanes 2 and 3). On SDS—PAGE the chains of the pepsin re- 
sistant domain have an M, of 45,000 (using collagen peptides as stan- 
dards). By electron microscopy (Fig. 1C), the length of this domain is 
133 nm. Assuming a length of 0.286 nm per amino acid residue in a 
triple-helical polypeptide (Rich and Crick, 1961) and an M, 98 per 
amino acid residue, the data from the gene sequence, SDS-PAGE, and 
electron microscopy are in agreement. 

The noncollagenous domain at the carboxy-terminus can be isolated 
following digestion of the molecule with bacterial collagenase (Schmid 
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Fic. 1. Rotary-shadowed preparations of type X collagen. (A and B) Micrographs of 
the intact molecule (59K chains). (C) The pepsin-resistant form of type X collagen (45K 
chains). Bar, 100 nm. Reproduced from Schmid et al. (1984), Ultrastruct. Res. 86, 186— 
191, by copyright permission of Academic Press, Orlando, Florida. 
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Fic. 2. SDS-PAGE (12% gel). Lane 1 contains al(II) chains. Lane 2 contains the 
chains of intact type X collagen (59K). Lane 3 contains the pepsin-resistant chains (45K) 
of type X. Lane 4 contains a bacterial collagenase digestion of type X, but the sample 
was not heated before loading the gel. Lane 5 contains a sample which was treated like 
that in lane 4, except the mixture was heated to 100°C before loading the gel. Lane 6 
contains the following protein mixture: bovine serum albumin (67K), ovalbumin (45K), 
soybean trypsin inhibitor (21.5K), and myoglobin (17K). Reprinted with permission 


from Schmid, T. M., and Lisenmayer, T. F. (1984). Biochemistry 23, 553-558. Copyright 
(1984) American Chemical Society. 


and Linsenmayer, 1984). As shown in Fig. 2, lane 5, the products of 
such a digestion contain chains of M, 15,000-20,000. If the SDS- 
PAGE is performed without heating the samples (lane 4), the products 
migrate with an M, of 45,000 (using noncollagenous proteins as stan- 
dards). Therefore, the presence of SDS alone is not sufficient to sepa- 
rate the chains of the noncollagenous domain; both heat and detergent 
are necessary, suggesting the chains are tightly associated through 
strong noncovalent interactions. 


V. CHROMATOGRAPHY OF TYPE X COLLAGEN 


Gel filtration of collagen molecules on Sephacryl S-500 (Schmid and 
Linsenmayer, 1983) or agarose (Quarto et al., 1985) is a convenient 
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method for separating the short type X molecules from small amounts 
of larger collagen molecules like type II. Sephacryl S-500 has certain 
characteristics which make it useful for such purifications. The elu- 
tion position of the type X molecule can be modified to maximize the 
desired separation. The type X molecules, with 59K chains, have a 
slight affinity for the gel, which appears to be due to a hydrophobic 
interaction with the globular domain of the molecule. Thus, this form 
of the molecule elutes from a column slightly after pepsin-treated 
molecules composed of 45K chains. The affinity of the globular do- 
main for the gel can be eliminated if the polarity of the solvent is 
reduced by the addition of 25% ethylene glycol to the buffer. Under 
such conditions, both forms of the molecule elute in the same position. 

Ion exchange chromatography was also used to purify both the 59K 
and 45K chains of type X collagen (Schmid and Linsenmayer, 1983; 
Cancedda et al., 1985). In one method, (carboxymethyl) cellulose was 
equilibrated at 42°C in 20 mM sodium acetate buffer containing 1 M 
urea, pH 4.8, and the two forms of type X eluted with a linear salt 
gradient (Schmid and Linsenmayer, 1984). The 45K chains eluted in 
the salt gradient (conductivity, 9 mmho) between the al(D) and «2(D 
chains, while the 59K chains of type X eluted (conductivity, 12.5 
mmho) after the a2(1). The recoveries from these columns were in the 
range of 30-35%. 

Attempts to use DEAE-cellulose chromatography for the purifica- 
tion of type X have not been successful using chromatographic condi- 
tions which have been successful for the purification of procollagen 
molecules (Linsenmayer and Little, 1978). Type X preparations bound 
so tightly to the resin that the material could not be recovered with 
high salt concentrations or extreme changes in pH (T. M. Schmid and 
T. F. Linsenmayer, unpublished results). 


VI. Amino ACID COMPOSITION OF TYPE X COLLAGEN 


Table I shows a comparison of the amino acid composition of chicken 
type X collagen reported from three different laboratories. In addition, 
the composition as deduced from the nucleotide sequence of the type X 
collagen gene (Ninomiya et al., 1986) is also presented. Comparison 
can therefore be made of the amino acid composition from the se- 
quence data with that of the molecule containing its full complement 
of posttranslational modifications. Examination of the type X helical 
domain, i.e., the pepsin-resistant domain with 45K chains, reveals an 
amino acid composition typical of a collagen triple helix. The molecule 
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TABLE I 
Amino ACID COMPOSITION OF TYPE X COLLAGEN@ 
Residue 45K% 45Ke 45K4 45Ke 59K> 59K°® NC1® 
4-Hyp ND 112 101 126 ND 80 ND 
Asp 9 35 41 39 13 49 24 
Thr 9 11 18 13 27 17 59 
Ser 20 16 30 20 43 38 94 
Glu 46 85 93 85 37 90 24 
Pro 247 136 113 117 186 118 65 
Gly 333 316 312 333 249 259 71 
Ala 46 53 65 58 48 53 53 
Cys 0 0 0 0 6 0 18 
Val 28 25 20 24 40 36 65 
Met 17 23 14 16 30 24 41 
Ile 24 28 22 25 37 31 71 
Leu 59 56 56 48 68 57 88 
Tyr 2 3 2 3 24 25 76 
Phe 13 11 13 12 22 26 35 
Hyl ND 35 31 33 ND 30 ND 
Lys 52 20 26 20 54 25 59 
His 9 7 7 8 12 12 18 
Arg 28 28 37 29 30 30 29 
Gln 42 ND ND ND 46 ND 53 
Asn 17 ND ND ND 25 ND 47 
Trp 0 ND ND ND 3 ND 12 
Gle ND ND ND 29 ND ND ND 
Gal ND ND ND 23 ND ND ND 


aAll values are reported as residues/1000 and are not corrected for loss or incomplete 
release of the amino acids during hydrolysis. Regions of the type X molecule: 59K, entire 
molecule (672 residues); 45K, pepsin resistant domain (460 residues); NC1, nonhelical 
region at carboxyterminus (170 residues); NC2, nonhelical region at amino-terminus (42 
residues); ND, not determined. 

bComposition derived from sequence of the type X gene (Ninomiya et al., 1986) and 
does not include posttranslational modifications of the protein. 

cComposition derived from protein hydrolysate (Schmid and Linsenmayer, 1983). 

dComposition derived from protein hydrolysate (Kielty et al., 1984). 

eComposition derived from protein hydrolysate (Quarto et al., 1985). 


has 33% of its residues as glycine, 25% as imino acids, and high levels 
of hydroxyproline and hydroxylysine. 

The total content of imino acids is higher than that of other col- 
lagens, and this may contribute to the high thermal stability of the 
triple helix (see Section VIII). About half of the proline residues and 
two-thirds of the lysine residues are hydroxylated. Based on the 
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glucose and galactose content of the molecule (Quarto et al., 1985), a 
large proportion of the hydroxylysine residues must be glycosylated. 

The overall amino acid composition of type X collagen is more sim- 
ilar to type IX collagen than to other collagen types. Like type IX 
collagen, the bovine type X molecule contains interchain disulfide 
bonds in the helical and globular domains of the molecule (Remington 
et al., 1984; Grant et al., 1985). However, avian and rabbit (Remington 
et al., 1983; Sussman et al., 1984) type X collagen lack interchain 
disulfide bonds. Type X contains higher levels of imino acids and 
methionine than does type IX. 

A comparison of the amino acid composition of the triple-helical and 
the noncollagenous regions of the type X molecule reveals some strik- 
ing differences. The most dramatic difference is the high content of 
aromatic residues in the noncollagenous regions (NC1 and NC2). Of 
the tyrosine residues, 94% (15) are in these regions, as are 60% (9) of 
the phenylalanine residues, and all of the tryptophan residues (2). 
More than half (12) of the methionine residues are found in the non- 
helical regions, which account for only 25—30% of the total mass. The 
regions are also rich in serine, threonine, valine, isoleucine, and 
leucine. 

From the amino acid sequence derived from analysis of the chick 
type X gene, four cysteine residues are present in the noncollagenous 
regions. One cysteine is in the small NC2 domain at the amino-termi- 
nus of the molecule, while the other three are found in the larger NC1 
domain at the carboxy-terminus. In chick type X, as mentioned above, 
the cysteine residues do not appear to form interchain disulfide bonds, 
but the possibility exists that they may form intrachain bonds. 

It should be pointed out that using the gene sequence to derive the 
amino acid compositions of the different forms of the molecule is com- 
plicated by our lack of knowledge of the potential processing of the 
molecule. A definitive relationship has not been established between 
the translation product of the type X gene and the form in which the 
molecule is secreted and ultimately deposited in the extracellular ma- 
trix. During these processes, there may or may not be small proteolytic 
cleavages of the molecule near its amino- and/or carboxy-terminus. 
Thus, the data derived by the different methods must be interpreted 
and compared with some degree of caution. 


VII. CYANOGEN BROMIDE CLEAVAGE 
OF TYPE X COLLAGEN 


The cleavage of the type X collagen with cyanogen bromide produces 
a large number of peptides, as would be predicted from its high content 
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of methionine. Compared to the peptides generated from type I or type 
II collagen, the CNBr peptides oftype X are all low molecular weight. 
A separation ofthe CNBr peptides oftype X collagen by SDS-PAGE is 
shown in Fig. 3. The majority of the peptides, which are fixed and 
stained in this gradient gel, have molecular weights between 6000 and 
13,000. These results compare favorably with the sizes of the peptides 
predicted from the location of methionine residues in the sequence of 
the molecule (see Vasios et al., this volume, Fig. 5). The largest CNBr 
peptides of type X should have M, values of approximately 12,000, 
8000, and 7000. 

The pattern oftype X collagen CNBr peptides is easily recognizable 
when compared to the peptides of other collagen types, most of which 
are considerably larger (Schmid and Conrad, 1982a; Gibson et al., 
1982; Remington et al., 1984; Bornstein and Sage, 1980). However, the 
relatively large number of peptides which migrate in the low-mo- 
lecular-weight region makes comparisons less useful in complex col- 
lagen mixtures; it also makes the comparison of minor differences in 
type X preparations more difficult. A 2-dimensional separation pro- 
cedure, such as that of Benya (1981), would probably be more effective 
for the detection of differences in type X samples. 
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Fic. 3. CNBr peptides of types I, II, and X collagens separated by SDS-PAGE (8-20% 
gradient gel). CNBr peptides of type I collagen are labeled on the left side of the gel. The 
molecular weights of two collagenous CNBr peptides are shown for comparison on the 
right side of the gel. 
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Other methods of cleaving type X might also be advantageous, such 
as the specific cleavage of the Asn-Gly bonds with hydroxylamine 
(Bornstein and Balian, 1977). From the sequence of the type X mole- 
cule, six bonds should be cleaved, resulting in seven peptides with M,s 
of approximately 18,000, 10,000, 8000, 7000, 6000, and 4000. 


VIII. THERMAL STABILITY OF TYPE X COLLAGEN 


Several methods for the chromatographic separation of other col- 
lagen chains involve the thermal denaturation of the collagen triple 
helix and subsequent chromatography under denaturing conditions. In 
initial attempts to purify the denatured chains of type X collagen, the 
molecule was heated to temperatures greater than 70°C and chro- 
matography was performed on a agarose column (BioGel A-5m) in the 
presence of 1 M CaCl,. The majority of the type X material eluted in a 
position expected for a molecule larger than an al(IJ) chain. This ob- 
servation suggested that individual type X chains were not separated 
by this procedure, which prompted an examination of the thermal 
stability of the type X molecule. 

The thermal stability of the type X triple helix was analyzed by 
circular dichroic (CD) spectropolarimetry. The intact type X molecule 
and its pepsin-resistant helical domain showed CD spectra typical of 
collagenous molecules (Schmid and Linsenmayer, 1984). They both 
exhibited a small peak of positive ellipticity at 222 nm and a very 
strong negative ellipticity at 197 nm. The ellipticity of the samples at 
both. wavelengths was greatly reduced at 55°C, which suggested that 
the collagen had undergone a conformational change from triple helix 
to random coil. 

The temperature-dependent change in the helical conformation was 
examined by monitoring the ellipticity of the samples at 222 nm as 
they were raised to progressively higher temperatures. Both the type 
X molecule and its pepsin-resistant domain exhibited a thermal de- 
naturation midpoint (Tm) of 47°C. Similar determinations on chick 
collagen types I and II generated T,, values of 42-44°C. Thus, the 
triple helix of type X collagen is more stable than that of types I and II. 

The noncollagenous regions of the molecule did not affect the ther- 
mal stability of the triple helix. Their presence, however, did affect the 
ability of the helical portion of the molecule to renature after thermal 
denaturation. If the type X triple helix was completely denatured by a 
30-min incubation at 55°C and then the preparation was rapidly cooled 
to 25°C, the preparation regained greater than 60% of the helical sig- 
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nal within 40 min. An identical experiment performed with the pep- 
sin-resistant domain of the molecule resulted in little, if any, reforma- 
tion of the triple-helical structure. This was true even when a much 
longer time period was allowed for potential renaturation to occur. It 
appears that some aspect of the noncollagenous domains of type X 
collagen is quite refractory to thermal denaturation. Thus, it keeps the 
chains in “register,” facilitating a rapid and correct reformation of the 
helical structure in the adjacent domain. Similar observations on the 
efficiency and completeness of renaturation have been made for pro- 
collagen vs collagen molecules (Hayashi et al., 1979). In type I pro- 
collagen, the molecule has interchain disulfide bonds within its non- 
helical domain at the carboxy-terminus, which account for the 
stability of the molecule. Since interchain disulfide bonds have not 
been detected in chicken type X collagen, noncovalent bonds, probably 
of a hydrophobic nature, may stabilize the nonhelical domains of the 
molecule. As described above, the noncollagenous domains of type X 
are enriched in aromatic and other hydrophobic amino acids. The ther- 
mal stability of the interchain interactions in the noncollagenous re- 
gions and the necessity for both heat and detergent to disrupt the 
interactions are consistent with the nature of hydrophobic interac- 
tions. 

The return of 60% of the helical signal upon cooling could be the 
result of each molecule regaining this percentage of its structure or of 
60% of the molecules regaining their complete helical structure. These 
possibilities were tested by determining the size distribution of the 
renatured molecules. If all molecules renatured to the same extent, 
one would expect them to elute as a single peak. If some had renatured 
completely and others had not, the elution profile should be more 
heterogeneous. The results obtained by gel filtration on Sephacryl 
S-500 (Schmid and Linsenmayer, 1984) showed that the majority of 
the renatured type X molecules migrated to a position indistinguisha- 
ble from that of native molecules, but approximately one-third of the 
material eluted as a smaller species. The smaller molecular-weight 
material, which eluted as a shoulder on the peak of native-sized mole- 
cules, was larger than the size of an individual denatured type X 
chain, suggesting that some structure had reformed. However, they 
had not reformed native molecules. 

Similar results were obtained by immunochemical analyses using a 
monoclonal antibody directed against type X collagen. This antibody 
recognizes a specific triple-helical epitope and will not bind to the 
denatured chains of the molecule. The complete characterization of the 
antibody will be described in Section XII. However, the conformational 
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dependence of this reagent provides a rigorous test of the ability of the 
type X molecule to reform an exact triple-helical conformation (at 
least at the site of the epitope). Denatured type X molecules were 
cooled to 25°C, and the extent of renaturation was determined by em- 
ploying the antibody in a competition enzyme-linked immunosorbent 
assay (ELISA). Again, approximately two-thirds of the molecules re- 
gained the conformation necessary to be recognized by the antibody 
(Schmid and Linsenmayer, 1985a). When the assay was used to exam- 
ine possible renaturation of the isolated collagenous domain of type X 
(45K chains), none was detected. 


IX. VERTEBRATE COLLAGENASE SUSCEPTIBILITY 
or TYPE X COLLAGEN 


The vertebrate collagenases are a class of enzymes that produce a 
scission through the triple helix, thereby reducing the thermal sta- 
bility of the resulting products well below that of physiological tem- 
perature (Gross, 1981). It was originally hypothesized by Gross and co- 
workers that the thermal stability of collagenase reaction products 
was of paramount importance, since this would render the collagen 
chains susceptible to cleavage by other proteolytic enzymes, thus com- 
pleting the degradation process. As has been discussed elsewhere in 
this volume, different collagen types exhibit different susceptibilities 
to the vertebrate collagenases. For example, within the extracellular 
matrix of cartilage, type II collagen is cleaved by the enzyme, while 
the minor collagens including type IX (Liotta et al., 1982) and type XI 
(Eyre et al., 1984) are not. Type X collagen is susceptible to cleavage by 
the enzyme (Schmid and Conrad, 1982a; Gibson ef al., 1983). It dis- 
plays the unique property of being cleaved at two sites within its triple 
helix rather than at a single locus as is typical of other susceptible 
collagens (Schmid et al., 1986a). 

Identification of the cleavage products, and some idea as to their 
location within the type X molecule, was achieved by analyzing digests 
of radiolabeled type X with purified human skin collagenase. For com- 
parison, digests of radiolabeled type II collagen were also analyzed. 
One type X substrate was preferentially radiolabeled in its helical 
domain by [?H]proline incorporation (90% of the proline residues are 
found in this region). A second type X preparation was preferentially 
radiolabeled by [*H]tyrosine incorporation in its nonhelical domain 
(80% of tyrosine residues occur there). Digestions of both substrates 
were performed with various concentrations of enzyme at 25°C. Since 
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this temperature is approximately 22°C below the Tn of the molecule, 
it should insure the integrity of the triple helices of the substrate and 
reaction products. SDS-PAGE analysis of the reaction products is 
shown in Fig. 4. As can be seen in lane 10, a complete digestion of the 
59K substrate chains generated three products with M,s of 32,000, 
18,000, and 9000. The sum of these three products accounts for the 
entire molecular mass of the substrate chains. 

A comparison of the collagenase cleavage products derived from 
[?H]proline-labeled (lane 10) and [?Hltyrosine-labeled (lane 15) type X 
substrates shows that the 18K fragment must contain the tyrosine- 
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Fic. 4. SDS-PAGE of the human skin collagenase digestion of type X and type II 
collagens. Each lane contained 10,000 cpm of collagen substrate. Lanes 1-5 contained 
(3H]proline-labeled type II collagen substrate. Lanes 6-10 contained [#H |proline-labeled 
type X collagen substrate. Lanes 11-15 contained [3Hltyrosine-labeled type X collagen 
substrate. The human skin collagenase was activated with trypsin and all the assays 
were performed at 25°C for 22 hr. Lanes 1, 6, and 11 contained no enzyme. Lanes 2, 7, 
and 12 contained 0.5 ul of enzyme. Lanes 3, 8, and 13 contained 1 ul of enzyme. Lanes 4, 
9, and 14 contained 2 ul of enzyme. Lanes 5, 10, and 15 contained 4 wl of enzyme. The 
vertebrate collagenase products of type II collagen are marked on the left side of the gel, 
and the relative molecular mass of the collagenase products of type X are shown on the 
right side of the gel. Reproduced from Schmid et al. (1986a), J. Biol. Chem. 261, 4184- 
4189, by permission of the copyright owner The American Society of Biological Chem- 
ists, Inc. 
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rich globular domain at the carboxy-terminus of the molecule. The 
32K and 9K fragments are enriched in [?H]proline and are probably 
derived from the helical domain of type X. The position of the 32K and 
9K fragments in the type X molecule can be determined from an anal- 
ysis of the single cleavage intermediates shown in lane 7. The largest 
intermediate has M, 50,000 chains and is probably generated by the 
loss of the 9K fragment from the 59K substrate. The 50K intermediate 
also contains the tyrosine-rich globular region (see lane 15). The 9K 
fragment must, therefore, be derived from the amino-terminus of the 
helical domain, since this is the only possibility for generation of a 
50K intermediate. 

The location of the vertebrate collagenase cleavage sites in type X 
collagen was also established by electron microscopic visualization of 
rotary shadowed preparations (Schmid et al., 1986a). To serve as a 
point of reference for the sites, a monoclonal antibody (X-AC9) was 
included with the cleaved preparations. The monoclonal antibody is 
known to bind within the triple-helical domain, at a locus approx- 
imately 20 nm from the globular domain. Figure 5 shows a prepara- 
tion of the major collagenase cleavage product (32K chains) of type X 
collagen to which the monoclonal antibody is bound. For comparison, 
intact type X molecules with bound antibodies (from a separate prepa- 
ration shown in black rectangles) have been superimposed such that 
the monoclonal antibodies on the cleaved and native preparations are 
aligned. It can be seen that two cleavages are made by the enzyme. 
One cleavage removes the carboxy-terminal globular region plus a 
short portion of the adjacent helix; the second removes a short portion 
of the helix at the amino-terminus. 

Similar products were observed after type X collagen was digested 
with a variety of sources of vertebrate collagenase, including ones 
from human skin, synovium, neutrophil, and rabbit cornea. The exact 
bonds cleaved by the enzyme are unknown. Assuming the cleavages 
occur between Gly-Leu or Gly-Ile residues, the standard sites of cleav- 
age for the enzyme in other collagen types (Woolley, 1984), there are 
several potential sites at the predicted locations within the type X 
molecule. As deduced from the genomic DNA sequence, two of the 
most likely candidates are a Gly-Leu bond 92 residues from the amino- 
terminus ofthe helical domain and a Gly-Ile bond 40 residues from the 
carboxy-terminus of the triple helix. Each of these bonds is close to än 
interrupted site along the triple helix in which a Gly-X-Y triplet is 
separated by two additional amino acids from the next Gly-X-Y triplet. 
These may be potential sites of instability within the molecule. Each 
interruption is followed by a sequence deficient in imino acid residues, 
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Fıc. 5. Electron micrograph of rotary-shadowed preparations of a vertebrate col- 
lagenase digestion of type X collagen and the X-AC9 monoclonal antibody. After the 
digestion of type X collagen with human skin collagenase, the X-AC9 antibody was 
added and the mixture was rotary shadowed. The long arrows point to the molecules that 
represent the major vertebrate collagenase cleavage products (32K) of type X. These 
molecules retain the epitope recognized by the X-AC9 antibody at one end of the mole- 
cule. For comparison, native type X molecules with a bound antibody have been super- 
imposed on the electron micrograph (outlined in the black rectangles) and aligned with 
respect to the bound antibody. The arrowheads mark X-AC9 molecules which have cross- 
linked two 32K collagenase cleavage products of type X collagen. Bar, 100 nm. Re- 
produced from Schmid et al. (1986a), J. Biol. Chem. 261, 4184-4189, by permission of 
the copyright owner The American Society of Biological Chemists, Inc. 


which might further compromise the helical stability of these regions 
(see sequence of type X molecule in Vasios et al., this volume, Fig. 5). 

Type X collagen appears to be degraded more rapidly than type II 
collagen using in vitro solution assays. Preparations oftype X and type 
II collagens were isolated from the same chondrocyte cultures after 
labeling with [?H]proline. A comparison of similar digestions by 
human skin collagenase, with equal amounts ofradioactivetype X and 
type II collagen substrates, showed that the type X substrate (Fig. 4, 
lane 9) is more rapidly degraded than the type II substrate (Fig. 4, lane 
4). 

Since vertebrate collagenase is thought to facilitate the degradation 
of collagenous molecules by generating products with reduced thermal 
stability, the thermal stability of the major collagenase product (82K 
chains) of type X collagen was examined. The thermal denaturation 
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curve for the molecule by CD spectropolarimetry gave a Tm of 43°C in 
neutral solutions (Schmid et al., 1986a). Thus, the denaturation tem- 
perature for the fragment with 32K chains is only slightly reduced 
from the Tm of the native molecule (47°C) and should not undergo 
spontaneous denaturation at the physiological temperature of the 
chick. Instead, the triple-helical fragment with 32K chains may be 
released en bloc from the center of the type X molecule. Whether it 
persists in vivo and has any physiological function is at present 
unknown. 


X. MOLECULAR STRUCTURE OF TYPE X COLLAGEN 


The most complete information on the structure of type X collagen 
has been derived from the nucleotide sequence of the type X collagen 
gene (see Vasios et al., this volume, Fig. 5; Ninomiya et al., 1986). A 
cDNA library was constructed from poly(A)* RNA isolated from long- 
term cultures of chick embryonic tibiotarsus chondrocytes. One cDNA 
clone (600 nucleotides) hybridized to mRNA (2500 nucleotides) from 
long-term chondrocyte cultures and mRNA isolated from the cephalic 
region of chick embryo sterna. The cDNA clone did not hybridize to 
mRNA from a 17-day chick embryo sterna (which would synthesize 
very low levels of type X collagen). The cDNA clone was used to screen 
a chicken EcoRI genomic library. A genomic clone was isolated which 
contained the cDNA sequence. The amino acid sequence predicted 
from the genomic DNA sequence was identical to the amino acid se- 
quence of two CNBr peptides of type X collagen. The coding sequence 
of the type X collagen gene was not interrupted by introns. 

The largest form of the molecule isolated from the medium of 
chondrocyte cultures is similar in size to the molecule which can be 
inferred from the DNA sequence of the type X collagen gene. However, 
the exact size of the molecule, as transported into the extracellular 
milieu, is not known. The molecule contains two noncollagenous do- 
mains, NC1 and NC2. NC1, the largest noncollagenous domain, is 
located at the carboxy-terminus and contains 170 amino acid residues. 
The NC2 domain is a short telopeptide-like region located at the ami- 
no-terminus of the molecule. 

The collagenous domain contains 460 amino acid residues and, in 
general, exhibits the repetitive Gly-X-Y sequence characteristic of a 
collagen molecule. There are, however, two general types of deviation 
from the Gly-X-Y sequence. Four positions in the molecule have a Gly- 
X-Gly sequence; three others have a Gly-X-Y which is separated from 
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the next triplet by two additional amino acids. As mentioned above, 
two of these positions may be sites for cleavage by vertebrate col- 
lagenase. These deviations from the usual Gly-X-Y sequence do not 
produce sufficient defects in the helical structure to make it suscepti- 
ble to cleavage by pepsin (Schmid and Conrad, 1982a; Gibson et al., 
1982), chymotrypsin (Gibson and Flint, 1985), or trypsin (T. M. 
Schmid, unpublished results). 

The sequence of the type X molecule (Vasios et al., this volume, Fig. 
5) also reveals the presence of a hydrophobic sequence in the non- 
helical domain (NC1) towards the carboxyl end of the molecule (Nino- 
miya et al., 1986). A hydrophobic sequence of 29 amino acids is bound- 
ed by two lysine residues and contains two cysteinyl residues. The 
hydrophobic nature and length of this sequence are characteristic of 
polypeptides that span a lipid bilayer and suggest the molecule may 
associate with a membrane. 

The intact type X collagen molecule purified from cell culture has 
been rotary shadowed and visualized in the electron microscope. As 
stated earlier, both the collagenous domain and the major globular 
domain (NC1) can be clearly visualized, the molecule having an over- 
all length of 138 + 5 nm (Schmid et al., 1984, 1986a; Kielty et al., 
1985). Frequently, the molecules associate at their globular domains to 
form aggregates (Fig. 1B). It is not clear whether such aggregation 
occurs in vivo and has physiological significance or whether it is sim- 
ply a preparational artifact. Pepsin digestion removes the globular 
domain from the carboxy-terminus of the molecule; the remaining 
collagenous domain (133 + 8 nm) accounts for most of the length. 

Type X collagen will form segment long spacing (SLS) crystallites 
when dialyzed against ATP (Schmid et al., 1984; Kielty et al., 1984), 
but this only occurs in molecules from which the noncollagenous do- 
main has been removed. The crystallites often occur as end-to-end 
dimers and, when positively stained, exhibit a banding pattern differ- 
ent from that of other collagen types (i.e., types I and II). The length 
determined for the crystallites (133 nm) is equivalent to that of the 
individual pepsinized molecules viewed in rotary shadowed prepara- 
tions. Preparations of type X molecules with intact noncollagenous 
domains formed masses of thin, fibrous material which lacked discern- 
able periodicity (T. M. Schmid, R. R. Bruns, and T. F. Linsenmayer, 
unpublished results). 

At present, we do not know whether type X collagen becomes as- 
sembled into a supramolecular structure in vivo. Studies are presently 
in progress to localize type X in vivo employing immunoelectron mi- 
croscopy with monoclonal antibody X-AC9. 
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XI. BIOSYNTHESIS OF TYPE X COLLAGEN 


Little information is available on the intracellular steps involved in 
the biosynthesis of type X collagen. Presumably, the molecule is syn- 
thesized on the rough endoplasmic reticulum, but specific information 
about the organization of the ribosomes or the sites of chain associa- 
tion and triple helix formation are unknown. Cell-free translation 
studies indicate that the newly synthesized chains of type X migrate 
close to the 59K chains of the fully hydroxylated and glycosylated 
molecules secreted by cells (Cancedda et al., 1985; Ninomiya et al., 
1986; Balian and Morris, 1986). Castagnola et al. (1986) have recently 
shown that type X contains a signal sequence of M, 1500-2000 which 
is removed by dog pancreas membranes in cell-free translation experi- 
ments. The NC2 domain at the amino-terminus of type X collagen 
contains a group of hydrophobic amino acids resembling a typical sig- 
nal sequence (see sequence in Vasios et al., this volume, Fig. 5). There- 
fore, approximately half of the NC2 domain is probably removed be- 
fore type X collagen is secreted. Type X collagen requires ascorbate for 
the proper hydroxylation of the molecule and, in the absence of ascor- 
bate, at least some under-hydroxylated molecules are secreted at 37°C 
(Capasso et al., 1982). 

Type X collagen was detected in several different sizes of intra- 
cellular vesicles by immunofluorescent histochemistry with mono- 
clonal antibody X-AC9 (Linsenmayer et al., 1986b). In certain car- 
tilages undergoing hypertrophy, the intracellular type X-containing 
vesicles are readily detected within chondrocytes just preceding the 
boundary at which the type X begins to be deposited as extracellular 
matrix. Figure 6 shows an example of such antibody-reactive intra- 
cellular vesicles within chondrocytes in a developing vertebral body. 
The detection of intracellular type X within cells which havenotype X 
in their surrounding matrix suggests the molecule may be stored in- 
tracellularly for a period of time before it is secreted. 

The thermal stability of the intracellular type X molecules was ex- 


Fic. 6. Fluorescence micrographs of sections reacted at different temperatures with 
anti-type X collagen monoclonal antibody. Sections were equilibrated (30 min), stained 
(30 min), and washed at various temperatures (A, 45°C; B, 50°C; C, 60°C. D was incu- 
bated at 60°C, (30 min) and cooled to 40°C before reacting with antibody at the lower 
temperature. In C, the white arrows point to cells which are nonreactive. In D, the 
arrowheads point to two cells in which the intracellular reactivity has returned upon 
cooling to the lower temperature. All are x420. Reproduced from Linsenmayer et al. 
(1986b), Exp. Cell Res., in press, with copyright permission of Academic Press, Orlando, 
Florida. 
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amined with monoclonal antibody X-AC9. When unfixed cryostat 
tissue sections were incubated with the antibody at various tem- 
peratures, the intracellular vesicles still retained their antibody reac- 
tivity at 55°C, but lost it by 60°C. This is considerably higher than one 
would deduce from in vitro measurements. The reactivity within the 
surrounding extracellular matrix was still retained at 67.5°C. This is 
the highest temperature we could examine, since the antibody itself is 
thermally denatured above this temperature. It is tempting to specu- 
late that the increased thermal stability of the type X collagen, when 
deposited in the extracellular matrix, is the result of its assembly into 
a supramolecular form following its movement from intracellular to 
extracellular compartments. Extracellular events might include pro- 
teolytic processing, fibril formation, and cross-link formation. 

Type X collagen is synthesized in a form that resembles a pro- 
collagen molecule, i.e., it contains both a collagenous triple-helical 
domain and a sizable noncollagenous domain at one end of the mole- 
cule. However, any experiments suggesting that the 59K chains of 
type X are processed to some other form must be evaluated with cau- 
tion, since the globular domain is sensitive to general proteolytic ac- 
tivity (Schmid and Conrad, 1982a; Gibson et al., 1982, Gibson and 
Flint, 1985). A recent report presented data suggesting the cleavage of 
the 59K chains of type X to 50K chains (Kielty et al., 1985). The loss 
occurred from the globular domain of the molecule. These experiments 
were performed using organ cultures of hypertrophic cartilage from 
the 17-day chick tibiotarsus. The cartilage pieces were pulsed with 
[2H]proline for 1 hr and chased with unlabeled proline. After 6 hr, the 
majority of the 59K chains had been converted to 50,000 chains. An- 
other pulse-chase organ culture study using rabbit wrist and cos- 
tochondral growth cartilage reported that type X was lost after a 20 hr 
chase (Sussman et al., 1984). A third study reported a rapid degrada- 
tion of type X collagen in cell culture (Capasso et al., 1984a). 

Other pulse—chase experiments have produced different results. 
Pulse—chase experiments using organ cultures of cartilage from the 
fetal bovine costochondral junction showed no detectable processing of 
type X, even after the chase was extended to 20 hr (Grant et al., 1985). 
In other experiments, type X was not processed in the medium of 
chondrocyte monolayer cultures during a time period in which type II 
procollagen was almost completely converted to molecules composed of 
al(ID chains (Schmid and Conrad, 1982b). It is possible that the differ- 
ent conditions of cell and organ culture employed may have contrib- 
uted to at least some of the variability in these results. 

The majority of type X molecules isolated from cartilage in vivo are 
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not processed (T. M. Schmid, unpublished observations). Chick em- 
bryos were made lathyritic with an injection of ß-aminoproprionitrile 
(ß-APN) into their airsacs on day 15 of development. After 3 days, the 
embryos were sacrificed and the type X extracted from their long 
bones with 4 M guanidinium chloride in the presence of protease in- 
hibitors. The type X was isolated with an affinity gel made of mono- 
clonal antibody X-AC9 coupled to Sepharose. The type X molecules 
isolated by the affinity gel migrated on SDS-PAGE at a position indis- 
tinguishable from the 59K chains of molecules isolated from chondro- 
cyte cell cultures. The types I and II collagens extracted from these 
same tissues have been completely processed to al and «2 chains. 
Thus, little if any type X is processed beyond the stage of 59K chains 
in vivo. 

The presence of the globular domain on the form of the mature type 
X molecule which is deposited in the extracellular matrix may greatly 
affect the ability of the molecule to form macromolecular aggregates. 
For example, type VII collagen appears to be present in tissue as SLS- 
like aggregates (Bentz et al., 1983). However, the globular domain of 
type X appears to inhibit SLS crystallite formation, at least under in 
vitro conditions. Rotary shadowed preparations of type X show ag- 
gregation of the molecules by their globular domains, but as stated 
earlier, it is not known if this is physiologically important. 

In chicken cartilage, type X collagen within the extracellular matrix 
appears to be covalently cross-linked. The extractability of type X 
collagen from cartilage in vitro (Reginato et al., 1986) and in vivo (T. 
M. Schmid, unpublished observations) was increased if the cartilage 
was incubated with the lathyrogen ß-APN. Experiments employing 
reducing agents suggest the cross-links do not involve disulfide bonds. 
Direct evidence for the involvement of specific amino acid residues has 
not been determined. SDS gels of purified type X collagen prepara- 
tions isolated from chondrocytes cultured in the presence of B-APN 
sometimes show a collagenous band of M, 120,000 (Schmid and Lin- 
senmayer, 1983). This protein most likely represents a dimeric form (B 
chain) of type X collagen chains, since it is precipitated from type X 
preparations with the anti-type X monoclonal antibody. The ß chains 
seem to be covalently cross-linked since their size does not change 
after boiling the molecules in the presence of a reducing agent and 
detergent. The ß chains also appear to be susceptible to proteolytic 
cleavage with pepsin, generating the 45,000 chains typical of the type 
X helical domain. Thus type X appears to contain covalent cross-links 
in a pepsin-sensitive region of the molecule. 

A quantitative comparison of the extractability of type X collagen 
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from radiolabeled 17-day chick sterna showed that the molecule was 
not easily extracted unless a lathyrogen was present. In the presence 
of B-APN, most of the radiolabeled type X collagen was extracted in 
either al M NaCl, 50 mM Tris, pH 7.4 buffer or 0.5 M acetic acid 
(Reginato et al., 1986). In the absence of B-APN, only about 20% of the 
type X was extracted. The remainder of the type X collagen could be 
recovered only after pepsin treatment followed by extraction in 0.15 M 
NaCl, 50 mM Tris, 20 mM dithiothreitol, pH 7.4 (Jimenez et al., 1986). 
The presence of the reducing agent was necessary to extract the type X 
collagen. Since the molecule recovered from this procedure contained 
45K chains which lacked cysteine residues, it was concluded that other 
cysteine-rich molecules must hold the type X collagen in the tissue. 


XII. IMMUNOHISTOCHEMICAL LOCALIZATION OF TYPE X 
COLLAGEN WITH A MONOCLONAL ANTIBODY 


Several monoclonal antibodies have been produced which recognize 
type X collagen, and one antibody, X-AC9, has been extensively char- 
acterized. The X-AC9 antibody has been useful as an immu- 
nohistochemical reagent to determine the location of type X in embry- 
onic and adult tissue (Schmid and Linsenmayer, 1985a,b; Linsen- 
mayer et al., 1986a), as a molecular marker in ultrastructural studies 
(Schmid et al., 1986a), as a probe for the thermal stability of the type X 
triple helix (Linsenmayer et al., 1986b), and as a preparative reagent 
for the quantitative isolation of type X from complex mixtures of 
proteins (Solursh eż al., 1986). 

To produce this antibody, female SJL/J mice were hyperimmunized 
with native type X collagen (59K chains) and their splenocytes fused 
with NS-1 myeloma cells. The resulting hybridomas were selected in 
hypoxanthine, aminopterin and thymidine-containing medium, and 
those producing the desired antibody were identified by ELISA using 
plates that were coated with intact type X (59K chains) or the triple- 
helical domain (45K chains) of type X. 

The antibody produced by the X-AC9 hybridoma gave the strongest 
reaction in ELISA and, therefore, the cells were cloned and the anti- 
body extensively characterized. The X-AC9 antibody is an IgG, mole- 
cule which binds in the triple-helical domain of the type X molecule. In 
competition assays, it did not react with the chicken collagen types I, 
II, II, IV, V, IX, or XI (1a2a3a). The antibody also did not bind to the 
thermally denatured chains of type X collagen. Examination of rotary 
shadowed preparations showed that the antibody binds to a single 
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epitope on the type X molecule, located approximately 20 nm from the 
carboxy-terminal end of the triple-helical domain (Fig. 5). 

The antibody was used for indirect immunofluorescent staining of 
unfixed frozen tissue sections. The embryonic chick tibiotarsus was 
examined first, since the earlier biochemical analyses had shown that 
the molecule is restricted to a defined region within this structure 
(Schmid and Conrad, 1982b). In the tibiotarsus, the earliest detectable 
reactivity for type X collagen was found at 7.5 days of development 
(stage 32 of Hamburger and Hamilton, 1951) in the cartilage matrix of 
the middiaphysis (Schmid and Linsenmayer, 1985b). This distribution 
is restricted compared to that of type II collagen, which is found 
throughout the entire cartilaginous anlage. At this stage, type I col- 
lagen is present in the sheath of periosteal bone that surrounds the 
middiaphysis and defines the primary center of ossification. At 9.5 
days of development (stage 35.5), the early formation of the marrow 
cavity can be observed as a middiaphyseal focus of cartilage erosion. 
At this time, the reactivity for type X collagen extends considerably 
outwards from the center of the diaphysis, occupying the middle 40% 
of its length. 

In Fig. 7, the location of type X collagen is compared with the dis- 
tribution of types I and II collagen in serial sections of a 13-day 
tibiotarsus (Schmid and Linsenmayer, 1985a). Figure 7C shows that 
type II collagen is distributed throughout the entire cartilaginous 
anlage. The marrow cavity (mc) now fills approximately the middle 
half of the structure, and the sheath of periosteal bone, which stains 
brightly for type I collagen (Fig. 7A), has thickened and progressed 
toward each end of the tibiotarsus. The type X (Fig. 7B) is localized in 
the cartilage matrix near the marrow cavity. 

The regions showing reactivity with the anti-X antibody at 7.5, 9.5, 
and 13 days of development are all composed of hypertrophic cartilage. 
Other investigators have shown that the hypertrophic chondrocytes 
within the tibiotarsus are first observed in the tibiotarsus, at approx- 
imately 6.5 days of development, in the center of the shaft (Scott- 
Savage and Hall, 1979). These large chondrocytes generally corre- 
spond to, and are demarcated by, the overlying sleeve of periosteal 
bone (identified in Fig. 7 with arrowheads). Thus, the deposition of 
type X collagen lags slightly behind the hypertrophy of the chondro- 
cytes, a result which is in complete agreement with our biochemical 
analyses. 

The distribution of type X collagen at 19 and 21 days of incubation is 
also consistent with its synthesis solely by the hypertrophic region of 
chondrocytes. By 19 days of development, most of the cartilage re- 
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Fic. 7. Fluorescence micrographs of distal half of 13-day chick tibiotarsus treated 
with monoclonal antibodies directed against collagen types I (A), X (B), and II (C). 
Arrowheads mark the distal boundary of hypertrophic chondrocytes. The marrow cavity 
(mc) and epiphysis (e) are labeled. Bar, 1 mm. Reproduced from The Journal of Cell 
Biology, 1985, 100, 598-605 by copyright permission of The Rockefeller University 
Press. 
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maining in the diaphyseal region now shows reactivity for type X. 
However, the region adjacent to the epiphysis is still negative. Most of 
the epiphysis itself is still negative, except for a small perivascular 
region of hypertrophic cells demarcating the region in which a second- 
ary center of ossification characteristically forms. At a higher magni- 
fication (shown in Fig. 8), this region shows the most intense reactivity 
for type X collagen in the matrix directly surrounding the blood vessel 
channels, with a zone of progressively decreasing matrix reactivity 
extending outward in a radial fashion. Frequently, however, isolated 
foci of strong cellular/pericellular reactivity (circular profiles) exist 
some distance from the region of strong matrix staining. The matrix 
surrounding numerous other epiphyseal blood vessels is negative for 
type X, suggesting that blood-borne stimuli are not solely responsible 
for the initiation oftype X synthesis, as has been previously suggested 
(Capasso et al., 1984b). 

In the 1-day-old chick, the relationships of collagen type II (Fig. 9A) 
and type X (Fig. 9B) to the center of hypertrophy within the epiphysis 


Fic. 8. Fluorescence micrograph of center of anti-type X reactivity in the epiphysis of 
a 19-day chick tibiotarsus. Type X reactivity surrounds two blood vessels at the future 
site of the secondary center of ossification. Bar, 200 um. Reproduced from Schmid and 
Linsenmayer (1985b), Dev. Biol. 107, 373-381, with copyright permission of Academic 
Press, Orlando, Florida. 
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Fic. 9. Fluorescence micrograph of the epiphysis (ep) of a 21-day (1 day posthatch) 
chick tibiotarsus treated with anti-type II (A) and anti-type X (B) antibodies. Type X 
reactivity is present throughout most of the tibia and also in the epiphysis at the 
presumptive site of the secondary center of ossification. The double-pointed arrows 
mark a zone (0.4 mm) of growth cartilage which does not react with the anti-type X 
antibody. This region separates the tibia shaft from the epiphysis. Reproduced from 
Schmid and Linsenmayer (1985b), Dev. Biol. 107, 373-381, with copyright permission of 
Academic Press, Orlando, Florida. 


have now become more definitive. Extensive reactivity for type II col- 
lagen is found throughout the epiphysis (ep) and the adjacent portions 
of the diaphysis. Within the epiphysis there is a loss of reactivity in 
the midepiphyseal zone of hypertrophic cells. In a serial section reac- 
tion with the anti-X antibody (Fig. 9B), this zone is strongly fluores- 
cent and has considerably increased in diameter from what it was 2 
days earlier. : 

In the sections reacted for type II collagen, there is a 0.4 mm band 
(demarcated by the double-headed arrow in Fig. 9A) between the epi- 
physis and the diaphysis. This band consists of a weakly reactive zone 
(nearest the epiphysis) and a strongly reactive zone (nearest the di- 
aphysis). In the section reacted for type X collagen (Fig. 9B), the corre- 
sponding region (again demarcated with arrowheads) is completely 
negative. Strong reactivity for type X begins abruptly on the di- 
aphyseal side of this band, where the hypertrophic cells are now clear- 
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ly evident. By contrasting the fluorescent patterns obtained with the 
two antibodies, it can be concluded that this band plus the adjacent 
hypertrophic region of the diaphysis comprise the characteristic re- 
gions of an epiphyseal growth zone (see Bloom and Fawcett, 1975). 

Preliminary experiments examining the distribution of the mole- 
cule in tibiotarsi from more mature chickens (7-9 weeks) show that 
type X remains in the region of calcified cartilage which separates the 
articular hyaline cartilage from the subchondral bone. In the chicken, 
the most superficial layer ofthe epiphysis is a fibrous layer containing 
type I collagen (von der Mark et al., 1976). A region of hyaline car- 
tilage containing a type II collagen underlies the fibrous layer. Type X 
is present a few cell layers above the tidemark, the boundary between 
the uncalcified and calcified cartilage. Type X is also present in the 
remnant cartilage at the core of spicules of endochondral bone (Schmid 
and Linsenmayer, 1985a). 

The brightest staining intensity of the X-AC9 antibody was always 
in regions of hypertrophic cartilage, while regions of bone also showed 
very light but positive reactivity. The sheath of periosteal bone of the 
developing tibiotarsus in Fig. 7 also reacted lightly, as did calvarial 
bone. Appropriate controls suggested that this light reactivity of bone 
was specific; however, further biochemical experiments were unable to 
detect the presence of type X collagen in chick calvaria. At this time, 
we assume the antibody is cross reacting with some other substance in 
bone or that type X, if present, is there in exceedingly small amounts. 

Type X collagen is also present in the notochord (Linsenmayer et al., 
1986a). The notochord is a cartilage-like skeletal tissue which has 
been shown to have an inductive influence on the formation of the 
cartilaginous vertebral bodies (Holtzer, 1961; Lash, 1968). Temporally, 
the appearance of type II collagen in the notochord precedes the detect- 
able appearance of type II collagen in the surrounding vertebral tissue 
(Linsenmayer et al., 1973). Conversely, however, the temporal ap- 
pearance of type X collagen in the notochord is subsequent to its ap- 
pearance in the surrounding vertebral bodies. It is detectable first in 
the hypertrophic cartilage of the vertebral bodies, and several days 
later it is detectable in the adjacent notochordal tissue. Also, the spa- 
tial distribution of type X within the notochord differs from that of 
type II. The embryonic notochord and adjacent vertebral bodies show a 
segmental distribution in which type X (Fig. 10A) is preferentially 
deposited in the midvertebral regions. Conversely, the type II collagen 
(Fig. 10B) accumulates in the intervertebral regions. This inverse dis- 
tribution of type X and II within the notochord seems to correlate with 
the different developmental fates of these regions. The type X-rich 
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Fic. 10. Fluorescence micrograph of a longitudinal section through three sequential 
vertebrae of a 19-day embryo. The vertebrae are designated V1, Vz, V3, and the interver- 
tebral regions are designated IV, and IVs. (A) Section has been reacted with an anti- 
type X antibody; (B) section has been reacted with an anti-type II antibody. In both A 
and B, the notochord is indicated by arrowheads. In A, the notochord is sectioned such 
that it runs uninterrupted through all the vertebral bodies and intervertebral regions; in 
B, it is uninterrupted except at site IV. where, due to the plane of section, it appears to 
be constricted. Bar, 200 um. Reproduced from Linsenmayer et al. (1986a), Dev. Biol. 113, 
467-473, with copyright permission of Academic Press, Orlando, Florida. 


region will become part of the bone and marrow of the centrum of the 
vertebrae (Williams, 1908); the type IJ-rich regions will give rise to the 
nucleus pulposus of the intervertebral discs (Wolfe et al., 1965). 
Type X was not detectable by immunohistochemical methods in em- 
bryonic tissue outside the skeletal system, nor was it detectable in 
permanent cartilaginous tissues, such as those of the trachea and scle- 
ra. Several noncartilaginous tissues which synthesize type II collagen 
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tested negative [i.e., the chick primary corneal stroma (Linsenmayer et 
al., 1977) and the vitreous body (Smith et al., 1976)]. However, it is 
always possible that a lack of immunohistochemical reactivity is the 
result of masking of a particular epitope as occurs for type V collagen 
(Linsenmayer et al., 1983). 


XIII. BIOCHEMICAL LOCALIZATION OF TYPE X COLLAGEN 


Several laboratories have analyzed the distribution of type X col- 
lagen within skeletal tissue by biochemical methods. Capasso et al. 
(1984b) examined the distribution of type X collagen during several 
stages of embryonic chick tibiotarsus development. The first detectable 
synthesis oftype X was in the diaphysis at 9 days of development and in 
the epiphysis by day 17. Immunohistochemical analysis seems to be 
more sensitive, sincean anti-X antibody was able to detect the synthesis 
of type X in the diaphysis at 7.5 days of development (Schmid and 
Linsenmayer, 1985b). 

Grant et al. (1985) analyzed collagen synthesis by bovine chondro- 
cytes derived from the growth plate at the costochondral junction of a 
bovine fetus. Type X collagen was synthesized only in the zone of 
cartilage degeneration and calcification, which included the first 400 
um of tissue between the bone and adjacent growth cartilage. Type X 
was not synthesized by the zones of maturation or proliferation, the 
reserve zone, or other structural cartilages. 

Several laboratories (Gibson et al., 1984; Gibson and Flint, 1985; 
Reginato et al., 1986; Jimenez et al., 1986) have analyzed the distribu- 
tion of type X in the chick sternum by biochemical methods. Whereas 
the epiphyseal growth plate of long bones exhibits a linear continuum 
of chondrocyte maturation, the sternum is a more complex structure 
which undergoes endochondral bone formation in three dimensions. 
The cephalic half of the sternum begins to ossify just prior to the 
hatching of the chick. The caudal aspect of the sternum remains car- 
tilaginous for a long period of time (>8 weeks), but eventually it also 
begins to ossify. Type X is synthesized by embryonic chondrocytes 
from the cephalic half of the tissue but is not synthesized by embryonic 
chondrocytes from the caudal half of the sternum. Before 16 days of 
development, sternal type X collagen is not detectable by biochemical 
methods, but between day 16 and 17 the synthesis of type X increases 
suddenly (Gibson and Flint, 1985). This is concomitant with the hyper- 
trophy of the cells in the region and consistent with the immunohisto- 
chemical observations of the tissue (Schmid et al., 1986b). 

The hypertrophic cells in the cephalic aspect of the sternum were 
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recognized as a translucent zone (Gibson and Flint, 1985; Reginato et 
al., 1986). Gibson and Flint (1985) isolated these regions and radiola- 
beled them in organ culture. Their experiments showed type X to rep- 
resent 2% of the newly synthesized collagen on day 16 of development, 
60% of the collagen by day 17, and approximately 80-90% of the 
collagen after 18 days of development. On day 19, the translucent 
region of cartilage begins to calcify (Gibson and Flint, 1985; Reginato 
et al., 1986). Thus, although type X represents a quantitatively minor 
collagen when an entire cartilaginous tissue is examined, it appears to 
be the major collagenous product of chondrocytes during their termi- 
nal stage of maturation. The accumulation of type X collagen precedes 
the calcification of the tissue. 


XIV. Functions OF TYPE X COLLAGEN 


Cartilaginous tissues contain at least three collagen types: type II, 
type XI (1a2a3a), and type IX collagen. Type X collagen is only present 
in a cartilage matrix containing hypertrophic chondrocytes and be- 
comes the major type synthesized by the most mature chondrocytes 
involved in endochondral bone formation. The molecule, therefore, 
bridges the spatial and temporal transition from cartilage to bone. 
This is true for the process of endochondral ossification during skeletal 
growth, for the maintenance of the articular cartilage—bone interface 
of adult bones, and for the endochondral repair of fractured bones 
(Grant et al., 1986). During long bone development and fracture re- 
pair, most of the cartilaginous tissue forms a temporary model which 
will eventually be removed and replaced by bone. Therefore, the ma- 
jority of type X synthesized by an organism will be removed within a 
relatively short period of time. 

One possible function of type X collagen may be to facilitate the 
removal of the hypertrophic cartilage. We would speculate this could 
be accomplished if type X collagen were more susceptible to degrada- 
tion than type IJ. Our preliminary experiments suggest that type X 
collagen is indeed more rapidly degraded by vertebrate collagenase 
than is type II collagen (Schmid et al., 1986a), and recent results by 
Dean et al. (1985) suggest that vertebrate collagenase activity is pre- 
sent within hypertrophic cartilage. The degradation of type X collagen 
may “open up” the hypertrophic cartilage matrix to other degradative 
enzymes and to the invasion of other cells. 

The structural characteristics of an extracellular matrix composed 
primarily of type X collagen may be very different from those of a 
cartilage matrix composed of type II collagen. The matrix surrounding 
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the most hypertrophic chondrocytes may consist primarily of type X 
collagen. The different length ofthe collagen molecule or the size of its 
molecular aggregate may significantly affect the interaction with 
other macromolecular constituents and the hydrodynamic properties 
of the matrix (Hukins and Aspden, 1985). 

The second possible function of type X may be in the formation of an 
extracellular matrix which is amenable to calcification. In some car- 
tilaginous tissues, the deposition of type X collagen either slightly 
precedes or is concomitant with the calcification of hypertrophic car- 
tilage. During ossification of the cephalic portion of the embryonic 
chick sternum, the region of hypertrophic cartilage becomes calcified 
soon after the appearance of type X. Within the organization of adult 
articular cartilage, the superficial hyaline layer is separated from the 
subchondral bone by a layer of calcified cartilage. Type X is present in 
this region and a few cell diameters above the tidemark, the boundary 
between the calcified and uncalcified cartilage. 

Matrix vesicles seem to be involved in the initiation of cartilage 
calcification (Anderson, 1967; Bonucci, 1967), and type X collagen is 
known to be associated with such vesicles (Habuchi et al., 1985). The 
vesicles are released from the plasma membrane of chondrocytes. 
They have a unique phospholipid composition, are rich in alkaline 
phosphatase, and contain elevated levels of calcium (Anderson, 1967; 
Bonucci, 1967; Ali, 1983). The release of matrix vesicles increases with 
the maturation of the chondrocyte (Ali, 1976; Glaser and Conrad, 
1981). Habuchi et al. (1985) showed that radiolabeled type X collagen 
was present in matrix vesicle preparations isolated from chondrocyte 
culture medium by ultracentrifugation. They also showed that the 
synthesis of collagen and alkaline phosphatase were coordinately reg- 
ulated. These results are particularly interesting, considering the 
presence of a hydrophobic region at the carboxy-terminus of type X 
which might span the lipid bilayer of a matrix vesicle (see sequence of 
type X in Vasios et al., this volume, Fig. 5). Thus, type X may be an 
integral component of matrix vesicles. Further experimentation is 
necessary to understand the relationship of type X collagen to the 
calicification process, 


XV. UNRESOLVED QUESTIONS ABOUT TYPE X COLLAGEN 


The list of unanswered questions concerning type X collagen is long. 
What are the biosynthetic steps involved in its synthesis and trans- 
port? What information directs the association of the three type X 
chains, rather than allowing the assembly of a mixed molecule con- 
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taining, for example, chains of both type IX and type X collagens? Are 
any interchain or intrachain disulfide bonds formed during the syn- 
thesis of type X? Are hydrophobic interactions between the chains 
important for chain assembly? Are such interactions a prerequisite 
for, or a result of, triple-helical formation? How are type X molecules 
transported to the extracellular matrix? 

Although the tissue distribution of type X is known from immu- 
nohistochemical and biochemical studies, we do not know if type X 
collagen forms aggregates within the tissue in vivo. Immunoelectron 
microscopic studies are needed to define this. Is type X assembled in a 
lateral or linear aggregate? Are the molecules staggered? What kind 
of cross-links occur and where are they located? 

Does the accumulation of type X in the tissue facilitate the turnover 
of the tissue? What are the quantitative relationships of type X to the 
other cartilage collagens during endochondral bone formation? Are 
the synthesis and/or removal of several different cartilage collagens 
coordinately regulated? What regulates the levels of type X collagen in 
tissues? What events initiate the synthesis of type X? Do the 
cytoskeletal changes which accompany cell hypertrophy trigger the 
synthesis of type X? Is there active vertebrate collagenase in the area 
of hypertrophic cartilage which will be removed? What other enzymes 
are necessary for the removal of the extracellular matrix in this re- 
gion? What cells are responsible for the turnover of cartilage matrix? 

Is type X involved in the calcification of cartilage? Does type X bind 
calcium? Do any other macromolecules bind to type X and are such 
interactions physiologically significant? 

What is the evolutionary relationship of type X collagen to other 
collagen types? Why does the type X gene have only one intron? Is type 
X present in organisms whose skeleton remains cartilaginous through- 
out life? 

How is human type X collagen related to chick or bovine type X 
collagen? How would abnormal type X molecules affect the develop- 
ment of the skeleton? Is type X collagen necessary for the endo- 
chondral repair of fractured bones? Might degradation products of 
type X collagen in body fluids be indicative of calcified cartilage ero- 
sion or endochondral bone repair? Would assays for these molecules be 
useful in monitoring the progress of osteoarthritis, rheumatoid arthri- 
tis, or bone fracture repair? 

The answers to these questions should provide some insight into the 
structural function of type X collagen. Such knowledge will provide a 
rational basis for future experimentation related to the diseases and 
development of the skeletal system. 
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I. INTRODUCTION 


This minor collagen was first discovered by Burgeson and Hollister 
(1979) during differential salt fractionation of pepsin digests of 
hyaline cartilage. The protein is best recognized by the separation of 
three distinct a chains on SDS-polyacrylamide electrophoresis, all of 
which migrate slightly slower than al1(II) chains. The chains were 
assigned the provisional names, la, 2a, and 3a. The collagen re- 
sembled type V collagen in the following properties: (1) both collagens 
precipitated at 1.2 M NaCl from acid solution; (2) each accounted for 
5% or less of total tissue collagen; (3) their a chains migrated in the 
same distinctive region on SDS-polyacrylamide electrophoresis; and 
(4) they shared characteristic features of amino acid composition that 
set them apart from types I, II, and III collagens. However, the la and 
2a chains gave unique cyanogen bromide (CNBr) peptide patterns on 
electrophoresis and chromatography, consistent with each being a ge- 
netically distinct product (Burgeson and Hollister, 1979; Reese and 
Mayne, 1981; Burgeson et al., 1982). The 3a chain, however, gave an 
almost identical CNBr peptide map to that of the al(ID chain, except 
that all the CNBr peptides of 3a ran slightly slower than their a1(ID) 
counterparts. From these results, it was suggested that the 3a and 
al(D chains have the same primary sequence, or at least are very 
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close genetic variants, but differ posttranslationally in their degree of 
lysine hydroxylation and glycosylation. 


Il. NOMENCLATURE 


Although first described several years ago, no type number has so 
far been assigned to this collagen. The main reasons for this have 
been, probably, a suspicion that this collagen fraction lacks any estab- 
lished molecular formula and consists of a mixture of several different 
homo- and heterotrimeric molecules, and the unknown significance of 
the apparent type II collagen gene product in the system. Indeed, in an 
earlier review Miller (1985) referred to the protein as type K collagen 
until its chemistry could be better resolved. 

Because a consensus of recent data now point to a primary hetero- 
trimeric parent molecule of composition 1a2a3a and establish, at the 
very least, that the 3a chain exists in heterotrimeric molecules with 
combinations of la and 2a chains, it now seems appropriate to give 
1a2a3qa collagen a type number. The next available number is type XI, 
which was already suggested for 1a2a3a collagen in a recent review of 
the collagen genes (Cheah, 1985). We propose, therefore, that the term 
type XI collagen be adopted. For consistency when using the new no- 
menclature, the la, 2a, and 3a chains will therefore become al(XD, 
a2(XD, and a3(XI), respectively. However, the original names will 
probably persist for a while and, indeed, are used in this article in most 
figures. It remains possible that type XI collagen includes native mole- 
cules with various combinations of al(XI), «2(XI), and a3(XI) chains, 
but this sort of complexity applies equally to the type V collagen sys- 
tem (Bornstein and Sage, 1980; Fessler et al., 1985; see Fessler and 
Fessler, this volume). Clearly, types V and XI collagens should be 
thought of as close relatives that, together, constitute a specialized 
subclass of the collagen family of proteins. 


III. ISOLATION AND ANALYSIS 


A. Pepsin Digestion and Salt Fractionation 


Originally, Burgeson and Hollister (1979) prepared type XI collagen 
from growing human (fetal and 4-year postnatal rib and femoral head) 
cartilages and from young adult bovine (18-month rib) cartilage. The 
homogenized tissue was extracted in 4 M guanidine HCl to remove 
proteoglycans. The washed residue was treated with pepsin (0.5-1.0 
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mg/ml) at 4°C for 24 hr, the digest neutralized to give a water-clear 
solution, then dialyzed against water; the precipitate that formed was 
sedimented and redissolved in 1 M NaCl at neutral pH. The type XI 
collagen was resolved from type II collagen by dialyzing this neutral 
solution against 0.1 M acetic acid at 4°C and centrifuging hard (6.2 x 
10% g-min) to recover type XI in the pellet (Burgeson and Hollister, 
1979; Burgeson et al., 1982). This procedure was adopted after initial 
experiments showed that the protein tended to be insoluble in dilute 
acetic acid. Later, others (Shimokomaki et al., 1980; Furuto and Miller, 
1983a; Sussman et al., 1984; Eyre et al., 1984b) have reported the 
isolation of type XI collagen by precipitating at 1.2 M NaCl directly 
from the acidic pepsin digest of cartilage, having removed type II 
collagen at 0.7-0.9 M NaCl. Milder centrifugation was used to clarify 
the digest initially and to sediment the type II collagen, e.g., 18,000 
rpm for 60 min in a Sorvall RC5C, SS-34 rotor (Eyre et al., 1984b). 
Possibly, the sedimentation of type XI from dilute acetic acid requires 
ultracentrifugation, perhaps needing or being aided by the presence of 
proteoglycan digestion products to which type XI may preferentially 
bind (Smith et al., 1985; see later). 


B. Chromatography and Electrophoresis 


Figure 1 shows the resolution by carboxymethylcellulose (CM-cel- 
lulose) column chromatography of the three a chains of type XI col- 
lagen prepared after pepsin digestion of neonatal human cartilage as 
reported by Burgeson et al. (1982). The «3(XI) chain coelutes with 
al(ID, the al(XD chain elutes just later than a1(V), and the «2(XI) 
chain comes last and later than all other a chains of the common 
collagen types, consistent with it being the most basic by amino acid 
composition (see Table II). 

Figure 2 shows how the chains migrate on SDS—polyacrylamide 
electrophoresis using the buffer system of Laemmli (1970). The a3(XD 
chain runs just above and merging with a1(II) in the position of «2(V), 
and the a1(XD chain runs in the position of a1(V). Despite being re- 
solved by SDS-electrophoresis, all three chains coeluted on agarose 
sieve chromatography in the position of a1(I), indicating that all are of 
similar length (Burgeson et al., 1982). 


C. Peptide Mapping 


Each «a chain gives a distinct profile of CNBr peptides on SDS- 
polyacrylamide electrophoresis (Burgeson and Hollister, 1979; Reese 
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Fic. 1. Elution profile of la [@al(XD], 2a [a2(XD], and 3a [a3(XD]| chains on CM- 
cellulose chromatography. The column was eluted at 60°C with a linear gradient of 0- 
0.1 M LiCl in 0.05 M Li acetate, pH 4.8, containing 1% (v/v) isopropanol. [Reproduced 
with permission from Burgeson et al. (1982)]. 


and Mayne, 1981; von der Mark et al., 1982; see Fig. 3) and CM- 
cellulose chromatography (Burgeson et al., 1982). The peptide profiles 
of «L(XI) and «2(XI) are unique, differing significantly from those of 
al(V) and a2(V), the chains to which «al(XI) and a2(XI) seem most 
closely related based on their other chemical properties. The «1(XI) 
chain was distinguished from a1(V) by giving one major peptide band 
(Fig. 3), which may be a doublet of peptides of about 400 residues each 
judging by their electrophoretic mobility (unpublished observations). 
All the CNBr peptides of the «2(XD) chain from the rat chondrosar- 
coma were isolated by HPLC molecular sieving and ion-exchange chro- 
matography, and their amino acid compositions determined (Furuto 
and Miller, 1983b). The belief that a1(XI), «2(XD), a1(V), and a2(V) are 
each distinct gene products was reinforced by the marked differences 
in their peptide maps on electrophoresis after digestion by sta- 
phylococcal V8 protease (Reese and Mayne, 1981; Burgeson et al., 
1982). The a3(XI) chain, however, gave a very similar CNBr peptide 
pattern to that of the al (II) chain, except that the major fragments of 
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Fic. 2. Resolution of la [al(XD], 2a [a2(XDJ, and 3a [«3(XD] chains by SDS-5% 
polyacrylamide slab gel electrophoresis. The collagen was extracted from calf growth 
cartilage by pepsin digestion. a, Type XI precipitated by 1.2 M NaCl at pH 3; b, type II 
collagen precipitated from the same digested by 0.7 M NaCl at pH 3. 


a3(XI) ran slower, suggesting they differed posttranslationally (Bur- 
geson and Hollister, 1979). Furuto and Miller (1983a) have isolated the 
individual CNBr peptides of 3a by column chromatography and com- 
pared them with their a1(ID counterparts. Amino acid analyses con- 
firmed that a3(XI) was richer in hydroxylysine and hydroxylysine 
glycosides than «al(II) prepared from the same tissue sample (Bur- 
geson and Hollister, 1979; Furuto and Miller, 1983a; also see Table II). 


D. Tissue Distribution 


Table I lists tissues in which type XI collagen has been found. It 
appears as a constant minor ingredient of all types of cartilage in 


Type | 
Type II 
Type Ill 
a2(V) 
ai(V) 


rn 077 ” 
+ + th th 
. e 
D vo 
mM 
+ - 


Fic. 3. Peptide profiles (CNBr) of the individual type XI collagen a-chains on SDS- 
10% polyacrylamide slab gel electrophoresis compared with other collagen chain types. 
[Adapted and reproduced with permission from Burgeson and Hollister (1979), Biochem. 
Biophys. Res. Commun. 87, 1124-1131.] 
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which type II collagen forms the main collagen framework. Its ratio to 
type II collagen (about 1:30 in adult cartilage) is similar to the ratio of 
type V to type I collagen presented by most noncartilaginous connec- 
tive tissues. An exception to this rule may be vitreous humor, which 
has been reported to contain 1a, 2a, and 3a chains in addition to the 
main collagen ingredient, type II collagen (Ayad and Weiss, 1984). 
Using similar tissue (2-year steer vitreous humor), type V collagen 
was found (in addition to type II collagen) and not la, 2a, and 3a 
chains (unpublished data). Thus, the anomalously fast electrophoretic 
mobility noted by Ayad and Weiss (1984) for their vitreous 2a may be 
explained by this chain being a2(V). 

Although type V collagen apparently can be expressed with type II 
collagen in tissues that lack type I collagen (vitreous and also articular 
cartilage, see below), the parallel situation of type XI collagen being 
expressed in a tissue that lacks type II collagen has not been observed. 
Fibrocartilage of the knee meniscus, in which type I collagen forms the 
main collagen framework with only traces of type II collagen, contains 
type V collagen with no type XI detected (Eyre and Wu, 1983). Type XI 
collagen is present in the intervertebral disc (Table I) and appears to 
be most concentrated where type II collagen predominates in the nu- 
cleus pulposus. 


E. Articular Cartilage 


A mixture of types V and XI collagens appears to be present in the 
1.2 M NaCl precipitated collagen fraction from adult bovine articular 
cartilage (Eyre, unpublished work). Moreover, the tissue samples were 
of normal hyaline cartilage, with no signs of fibrous degeneration or of 
type I collagen in the pepsin digests. On electrophoresis of the collagen 
fraction that precipitates at 1.2 M NaCl, pH3, from a pepsin digest of 
18-month to 2-year steer cartilage, the ratio of 1a to 2a was obviously 
greater than 1:1 and nearer to 2:1, whereas the ratio in such fractions 
from fetal human or fetal bovine cartilage appeared to be about 1:1. 
Moreover, on CM-cellulose chromatography of the 1.2 M NaCl fraction 
from adult bovine articular cartilage, the 1a peak was asymmetrical. 
The trailing shoulder gave a CNBr peptide profile of 1a and the lead- 
ing half of «1(V) (not shown). Using reverse phase HPLC, the sus- 
pected «1(V) and 1a chains could be fully resolved (Fig. 4), each giving 
a distinctive CNBr peptide profile (not shown). These HPLC/SDS- 
PAGE profiles of whole chains from articular cartilage also resolve a 
component that runs in the position of authentic a2(V) (Fig. 4). 

The above results, therefore, indicate that type XI and V collagens 
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Fic. 4. Reversed-phase chromatography and subsequent electrophoresis of the re- 
solved minor collagen a chains from mature bovine articular cartilage. (a) Elution 
profile of 200 ug of collagen (1.2 M NaCl, pH 3, precipitate from a pepsin digest) on an 
Aquapore RP-300 column (4.6 mm x 25 cm, Brownlee) using a composite gradient of 
acetonitrile:n-propanol (3:1 v/v) in aqueous 0.1% trifluoroacetic acid (Eyre and Wu, 
1983). (b) Collected 2-ml fractions from (a) were dried and then run in consecutive lanes 
of a SDS—5% polyacrylamide slab gel. This sample still contained type II collagen which 
coeluted from the column with 3a and overlapped with 3a on electrophoresis. 
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coexist in adult bovine articular cartilage, with type V apparently 
having accumulated with age after the neonatal period. Analyses of 
serial slices (200 um) taken from surface to deep cartilage showed no 
obvious enrichment of type V relative to type XI collagen at the sur- 
face or in any region of the tissue (unpublished results). Moreover, 
type II collagen was the only major collagen species present; no type I 
collagen was detected. 

These observations may have a bearing on the report by Rhodes and 
Miller (1978) that a type V-like fraction could be prepared from artic- 
ular cartilage which appeared to consist exclusively of a1(V) chains. It 
seems likely, also, that the collagen identified in cultures of rabbit 
articular chondrocytes and called X and Y chains (Cheung e¢ al., 1976; 
Benya et al., 1977), consisted of a mixture of type V and XI collagen a 
chains. 

By immunolocalization experiments, Evans et al. (1983) detected 
type V collagen in bovine nasal cartilage. Later, they withdrew this 
conclusion when they discovered that their anti-type V antiserum rec- 
ognized type VI collagen, which then was presumed to be the source of 
the observed immunoreactivity in cartilage (Ayad et al., 1984). Bio- 
chemical data, however, indicate that normal adult articular cartilage 
can contain both types V and VI collagens in addition to type XI, type 
IX, and the bulk type II collagen (Wu and Eyre, unpublished results). 


IV. STRUCTURE 


A. Amino Acid and Carbohydrate Composition 


Types V and XI collagens prepared after pepsin digestion share dis- 
tinctive features of amino acid composition (Table II). All four chains, 
al(XD, «2(XD, al(V), and a2(V), contain half the number of alanine 
residues and are much richer in hydroxylysine compared with the « 
chains of types I, II, and III collagens. In subtle details, «1(XD) matches 
al(V) and a2(XT) matches a2(V), the latter pair being richer in his- 
tidine and threonine. In contrast, the a3(XI) chain is indistinguishable 
from al(II) in amino acid composition, except in degrees of lysine 
hydroxylation and hydroxylysine glycosylation (Table II). 


B. Chain Composition of the Native Molecule 


Although growing evidence points to a principal native molecule of 
composition, ala2a3(XD, it is still possible that type XI collagen prep- 
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TABLE II 
AMINO AcıD AND HYDROXYLYSINE GLYCOSIDE COMPOSITIONS OF TYPE XI COLLAGEN 
CHAINS FROM ADULT BOVINE ARTICULAR CARTILAGE” 


«(XD a2(XD a3(XD 


(la) (2a) (3a) al (ID al(V)d a2(V)d 
3-Hyp 2 2 2 1-2 NDe NDe 
4-Hyp 100 96 104 104 107 109 
Asp 49 49 45 48 48 51 
Thr 20 25 21 21 19 26 
Ser 28 25 26 26 33 31 
Glu 101 94 88 86 91 84 
Pro 121 112 118 116 118 97 
Gly 326 334 332 333 334 341 
Ala 51 51 100 100 46 52 
Cys 0 0 0 0 0 0 
Val 22 23 17 18 23 24 
Met 8 8 7 10 8 11 
Ile T. 14 9 10 18 16 
Leu 36 38 25 25 38 35 
Tyr 3 4 2 2 1 2 
Phe 11 11 13 13 11 14 
Hyl 37 40 21 17 35 24 
Lys 19 17 15 18 20 18 
His 7 11 3 3 8 11 
Arg 42 46 52 50 44 50 

Molar ratio, 8.3 7.5 2.7 1.6 1.5° 2.5° 
glegalHyl/galHyl 

Glycosylation (per- 77 75 74 61 86° 58° 
centage) 


aD. R. Eyre, unpublished data. The chains of types XI and II collagens were purified 
by salt fractionation and CM-cellulose column chromatography from a single tissue 
sample. ND, Not determined. 

‘Data from Burgeson et al. (1982). 

¢Calculated from the glucose (glc) and galactose (gal) contents, whereas in columns 1~ 
4 the values are based on hydroxylysine glycosides in a base hydrolysate by ion-ex- 
change chromatography and ninhydrin color yield. 


arations include a mixture of various heterotrimeric combinations of 
chains such as (a1).02(XD), (a2)oa3(X1), a1,(XD ete. 

The following are the points of evidence for a heterotrimeric mole- 
cule that combines the three different chains. 


1. Highly purified preparations of type XI that are not contami- 
nated with type II or type V collagens (for example, from fetal car- 
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tilage) give a ratio of the three a chains of 1:1:1 by densitometry after 
SDS-polyacrylamide electrophoresis. 

2. Three ß bands of roughly equal prominence are seen in such 
preparations (see Fig. 2) that have mobilities consistent with their 
being the three possible heterodimers from intramolecular cross-link- 
ing in 1a2a3a molecules. However, monomeric native molecules were 
not isolated for this analysis, so the dimers could result from either 
intramolecular or intermolecular cross-linking. This evidence is there- 
fore suggestive, but not proof. It is clear, nevertheless, that ß dimers of 
3a are absent from pure type XI preparations; if present, they would 
be resolved on electrophoresis running close to the B1(ID) chain position 
(see Fig. 2). 

3. Attempts to fractionate potential subpopulations of native type 
XI molecules by differential salt precipitation or chromatography un- 
der nondenaturing conditions (Burgeson and Hollister, 1979, Ayad and 
Weiss, 1984) have not succeeded. 

4. Native type XI collagen molecules extracted from lathyritic chick 
cartilage consist of a1(XI), a2(XD, and «3(XD chains in equal amounts, 
all three of which are about 10,000—15,000 MW larger than the chains 
of the pepsin-treated protein. The peptide extensions are apparently the 
retained amino-propeptides (Broek, 1984; G. Balian, personal commu- 
nication). This is consistent with the native functioning form of type V 
collagen which appears to be a molecule retaining a portion of its 
propeptide sequences (Fessler et al., 1985; Broek et al., 1985). 

5. Finally, unpublished observations that type XI collagen forms 
only one species of segment long spacing (SLS) crystallite as judged by 
electron microscopy (S. Ayad and B. Brodsky, personal communica- 
tions) provide further evidence for a single molecular species of com- 
position ala2a3(XD. 


The evidence is still inconclusive whether «3(XI) and al(II) have 
identical sequences. Furuto and Miller (1983a) isolated all the 3a 
[@3(XD] CNBr peptides and could find no basic differences with o1(ID) 
in their chromatographic properties or amino acid compositions. 
Though data and opinions are, therefore, leaning toward posttransla- 
tional differences only, subtle distinguishing features of their CNBr 
peptide electrophoretic profiles remain puzzling and leave room for 
localized sequence differences (von der Mark et al., 1982; Eyre et al., 
1984b). Even if #3(XI) is a product of the al(ID gene, as seems likely, 
there remains the question of how some proal(II) chains are selected 
to associate with prow1(XI) and proa2(XT) chains to form type XI mole- 
cules. One explanation might be alternative splicing of the primary 
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transcript in the formation of functional mRNA. Perhaps, for exam- 
ple, the triple-helical body of the a3(XD chain is identical to that of the 
al(II) chain, but one or both extension sequences might be spliced 
differently for «l(II) and «3(XI), analogous to the alternative splicing 
that produces fibronectin protein variants from the single fibronectin 
gene (Tamkun et al., 1984). A firm answer, most probably, must await 
sequence data on 3a and al(II) from the same animal source at the 
protein level. Although protein sequence data on the bovine al(ID) 
chain showed signs of microheterogeneity consistent with a major and 
minor form (Butler et al., 1977), more recent gene cloning work indi- 
cates only one human copy of the «aI(II) gene (Sheffield and Upholt, 
1985; Sangiorgi et al., 1985), which implies that these sequence vari- 
ants were due to allelic differences. 

Neither protein sequences nor molecular genetic data have yet been 
reported for the al(XT) or «2(XD) collagen chains. 


C. Polymeric Form 


Based on its molecular size and conformation, type XI collagen fits 
best in the class of Group 1 fibril-forming collagens (Miller, 1985). The 
actual polymeric form and location of type XI in tissues is still un- 
known. For example, it remains to be established if it exists as banded 
fibrils in its own right similar to those formed by types I, II, and III 
collagens. The pepsin-solubilized native protein molecules can be in- 
duced to form banded thin fibrils (of 60—67 nm periodicity) in vitro 
(Smith et al., 1985; B. Doyle, personal communication) using phos- 
phate-buffered saline or low-ionic-strength phosphate and conditions 
similar to those needed to form banded fibrils from type V collagen 
(Chiang et al., 1980; Adachi and Hayashi, 1985). However, there is a 
growing suspicion that type V, and by implication type XI, collagens 
function in the extracellular matrix not as homopolymers in their own 
right but copolymerized within the fibrils that form the bulk fibrillar 
network, i.e., type V hybridized with type I and type XI hybridized 
with type II. Indirect evidence for such a codistribution of types I and 
V collagens in the extracellular collagen framework of cornea was 
presented by Linsenmayer et al. (1985). 


D. Cross-Linking 


Type XI collagen contains similar cross-linking amino acid residues 
(Eyre, 1987; Wu and Eyre, 1985; Eyre, unpublished) to those formed 
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on the hydroxylysine-aldehyde pathway (Eyre et al., 1984a). Thus, 
type XI collagen, isolated from a pepsin digest of adult bovine car- 
tilage, was treated with NaBH,, acid hydrolyzed, and the hydrolysate 
run on the cross-linking amino acid program of a Glenco amino acid 
analyzer (Eyre, 1987; unpublished results). Roughly equal amounts of 
dihydroxylysinonorleucine (DHLN, 0.38 mol per mol of collagen in 
leucine equivalents of ninhydrin color) and hydroxylysyl pyridinoline 
(HP, 0.37 mol per mol of collagen in leucine equivalents) were present. 
In contrast, type II collagen from the same digest contained mainly HP 
(1.5 mol/mol) with only a trace of dihydroxylysinonorleucine (0.04 
mol/mol). It would seem, therefore, that cross-linking in type XI col- 
lagen tends to stay at the stage of divalent reducible bonds, perhaps 
because retained propeptides inhibit further cross-linking maturation 
reactions. Nevertheless, this common basic mechanism of cross-link- 
ing further supports the notion that type XI collagen should be consid- 
ered a member of the same class of fibrillar collagens as types I, II, and 
III collagens, along with type V collagen (Eyre, 1987). Furthermore, it 
seems that as a general rule the chemical type of cross-linking exhib- 
ited by these fibril-forming collagens (i.e., whether predominantly 
based on lysine aldehydes or hydroxylysine aldehydes) tends to be 
tissue specific rather than collagen-type specific (Eyre, 1987). 


V. BIOSYNTHESIS 


Type XI procollagen chains have been identified as products of 
cultures of chondrocytes and cartilage tissue in several laboratories 
(Schmid and Conrad, 1982; Remington et al., 1983; Sussman et al., 
1982, 1984; Clark and Richards, 1985; Jimenez et al., 1986; Reginato et 
al., 1986). They are present as disulfide-bonded trimeric procollagen 
molecules, the individual proa chains of which resolve above proal (II) 
on SDS electrophoresis (Clark and Richards, 1985). In contrast to the 
pepsin-trimmed chains, the individual pro« chains appear slightly 
larger than the proe chains of type I collagen as judged by their elution 
position on agarose sieve chromatography (Clark and Richards, 1985). 
However, with current data it is not yet clear whether a single form of 
procollagen molecule, of composition proal,proa2,proa3(XI), is pre- 
sent. Further analyses of the kind designed to answer this question for 
type V procollagen molecules (Kumamoto and Fessler, 1980) should 
give definitive answers. 

It was noted that type XI procollagen, in contrast to type II pro- 
collagen, was preferentially associated with the cell layer rather than 
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being put out into the culture medium (Mayne et al., 1983; Clark and 
Richards, 1985; G. Balian, personal communication). This might be 
taken as evidence that type XI is concentrated and functions per- 
icellularly but could equally reflect differences in the solubility or 
aggregation properties of the two collagens under the culture condi- 
tions used. 

Chondrocytes can be induced to change their phenotype in culture 
under a variety of conditions to produce type I collagen instead of type 
II collagen. It has been shown that this switch in bulk collagen ex- 
pression is accompanied by a change in synthesis from type XI to type 
V collagen using chick chondrocytes under various culture conditions 
(Mayne et al., 1983, 1984; Yasui et al., 1986). 

Synthesis of type XI collagen chains has also been demonstrated in 
cultures of chondrocytes and whole tissue from rabbit ear elastic car- 
tilage (Madsen et al., 1984). 


VI. PROTEASE SUSCEPTIBILITY 


None of the chains of bovine type XI collagen is cleaved by synovial 
or gastric mucosal collagenase at 25 or 37°C under conditions that 
cleave types I, II, and III collagens into three-quarter TCA and one- 
quarter TCB fragments (Eyre et al., 1984b). One study had concluded 
that the 3a chain of chick type XI collagen was degraded by skin 
collagenase while the 1a and 2a chains were not (Liotta et al., 1982). 
This evidence of cleavage probably resulted from residual contamina- 
tion of the substrate by type II collagen, for no cleavage of a3(XI) was 
observed in other studies on pure preparations of type XI (Fig. 5) 
(Furuto and Miller, 1983a; Eyre et al., 1984b). Why the a3(XD chain 
fails to cleave is unknown. The reasons may be, for example, because it 
differs in amino acid sequence and so lacks the specific cleavage site 
present in «L(II), or because this sequence is inaccessible when folded 
in heterotrimeric molecules together with a1(XI) and «2(XI) chains. 
This issue goes back to the fundamental question, whether a3(XI) and 
al(II) are posttranslational variants of the identical gene product or 
differ in primary sequence. 

Unhydroxylated type XI collagen made by chondrocyte cultures is 
unusually sensitive to pepsin, trypsin, and chymotrypsin (Clark and 
Richards, 1987). Chondrocytes isolated from chick embryo sterna were 
incubated with 2,2’-dipyridyl to inhibit prolyl and lysyl hydroxylation. 
All three chains, a1(XI), «2(XI), and «3(XI), in the disulfide-bonded 
type XI precursor molecules were extensively degraded even at low 
temperatures (0-15°C) under conditions when type II precursor mole- 
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Fıc. 5. SDS-7.5% polyacrylamide electrophoresis of type XI collagen showing the 
failure of human synovial collagenase to cleave any of the chains. Type II and type XI 
were prepared by pepsin from young bovine cartilage. The 0.9 M NaCl fraction of type 
XI still contained type II molecules which were cleaved giving al(ID4 fragments, where- 
as no such cleavage is evident for the purer 1.2 M NaCl fraction. —, Control incubations 
with buffer alone; +, collagenase incubations. 


cules were not cleaved within the triple-helical domains of the un- 
hydroxylated molecules. The results imply that unhydroxylated type 
XI procollagen fails to adopt a stable triple helix under conditions that 
promote unhydroxylated type II collagen to do so. The physiological 
significance of this behavior is unknown. 
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Type XI collagen resembles type V collagen in its lack of susceptibil- 
ity to the animal collagenases that can cleave types I, II, and III col- 
lagens. It also resembles type V collagen in being cleaved in its a1(XI) 
and «2(XI) chains by thrombin at 36°C, but not at 33°C, and by a tumor 
metalloproteinase at 30°C that also degrades type V collagen at 25- 
30°C (Liotta et al., 1982). The susceptibility of the pepsin-soluble sub- 
strate to thrombin at 36°C reflects denaturation of the collagen sub- 
strate. The cleavage of native type XI chains into specific large frag- 
ments by a metalloproteinase that has a similar activity against 
native type V collagen molecules (Liotta et al., 1982) invites com- 
parison to the specificity of the collagenases that degrade types I, II, 
and III collagens. However, only the a1(XI) and a2(XD chains (1a and 
2a, not 3a) seemed to be cleaved (Liotta et al., 1982), though con- 
taminating type II collagen may have obscured any 3a cleavage in 
these experiments. 

In summary, it seems unclear whether any of the type V-degrading 
metalloproteinases are collagen-type specific, in the manner of the 
known collagenases, or represent nonspecific gelatinases (Mainardi et 
al., 1980; Murphy et al., 1985) that can attack partially unfolded re- 
gions of types V and XI collagen molecules at 25-30°C. On the other 
hand, failure to be cleaved by specific collagenases that degrade types 
I, II, and III collagens might be a desirable biological feature in types 
V and XI collagens. 


VII. IMMUNOCHEMISTRY 


An antiserum raised in rabbits against fetal bovine type XI collagen 
(pepsin digested, 1.2 M NaCl, pH 3 precipitate) showed staining by 
indirect immunofluorescence concentrated as a diffuse halo only 
around chondrocyte lacunae in epiphyseal cartilage (Ricard-Blum et 
al., 1982). Similarly, antibodies raised against bovine type XI collagen 
in guinea pigs diffusely stained the territorial matrix around the 
chondrocytes of bovine nasal cartilage (Evans et al., 1983). Although 
the latter antiserum was purified by serial absorptions on affinity 
columns prepared from types I, II, III, IV, V, and IX collagens, type VI 
collagen at that time was unavailable. This is important as a later 
report (Ayad et al., 1984) noted that an anti-type V collagen antiserum 
was strongly positive against type VI collagen and so raises similar 
questions on the specificity of anti-type XI polyclonal antibodies since 
type VI collagen is widely distributed. 

The molecule isolated from cartilage, and thought to consist of 
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a1,(V), was localized pericellulary using immunofluorescence and a 
polyclonal antibody to a1(V) (Gay et al., 1981). 

When type XI collagen was injected into rats, it induced an arthritis 
in one strain (Alderley Park) similar to that induced by type II col- 
lagen (Morgan et al., 1983). In Sprague—Dawley rats, however, only 
type II collagen induced arthritis, though both collagen types were 
immunogenic in both rat strains. 

In unpublished studies (A. R. Poole and D. R. Eyre), rabbit anti- 
bodies raised against bovine type XI collagen reacted strongly with 
al(XD and a2(XD chains on immunoblotting after SDS—polyacryl- 
amide electrophoresis. This was evident even after absorbing the anti- 
serum against types II and V collagens, showing that al(XD and 
a2(XI) contain epitopes not present on types II and V collagens and 
providing further evidence that these chains are genetically distinct 
from the type V collagen chains. However, it was not possible with this 
antiserum to absorb all reactivity against type IX collagen, which by 
immunoblotting after SDS electrophoresis could be detected as a con- 
taminant of the type XI antigen. 


VIII. PATHOLOGY 


Among normal tissues, type XI collagen has been found only in 
those that contain type II collagen as a major matrix constituent. How- 
ever, type XI was found recently in the abnormal cartilage of a child 
who died at birth with a very rare heritable dwarfing syndrome, the 
Langer-Saldino variant of achondrogenesis (Eyre et al., 1986). No 
type II collagen could be detected in any cartilaginous tissue of this 
infant, despite types XI and IX collagens making up about 10% each of 
the total collagen (Eyre et al., 1986; unpublished results) and type I 
collagen replacing type II collagen. In the future, such cases might 
give insight on the biological function of the minor cartilage collagens 
and help decide whether the «3(XT) and al(ID chains are identical 
gene products or, perhaps, the result of alternative splicing of mRNA. 


IX. POSSIBLE FUNCTIONS 


Although no function has been described for type XI collagen, some 
suspected functions include (1) regulating the diameter of type II col- 
lagen fibrils; (2) being a specific building block of the perichondrocyte 
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basket of fibrils; and (3) mediating collagen-proteoglycan interac- 
tions. 

The possibility that type XI collagen is involved in regulating fibril 
diameters is supported by the finding that type XI can be induced to 
form only very thin fibrils in vitro. This was observed using the pepsin- 
solubilized protein (Smith et al., 1985) or the intact protein with re- 
tained N-propeptides that was extracted by salt from lathyritic tissues 
(Broek, 1984). Therefore, if type XI collagen is naturally distributed 
within the same fibrils as type II collagen, it may be able to control the 
diameters of the hybrid polymer, either by forming a coating layer on 
the surface of the fibril or through its quantitative ratio to type II 
collagen in the fibril. 

The notion that type XI collagen may be restricted to the pericellular 
domain rests on immunofluorescence light microscopy using poly- 
clonal antibodies to type XI collagen (see above) and by extrapolation 
of similar early findings with antibodies to type V collagen (Gay et al., 
1981). For type V collagen, at least, this concept of a restricted per- 
icellular location seems to be changing. Data which are accumulating 
using very well characterized monoclonal antibodies to chick type V 
collagen show a uniform distribution throughout the extracellular ma- 
trix (Linsenmayer et al., 1985). 

Smith et al. (1985) have provided evidence that type XI collagen, in 
the form of native molecules in solution or reconstituted fibrils, exhib- 
its a firm binding to cartilage proteoglycans. The binding was rela- 
tively nonspecific in that free sulfated polysaccharides, including 
chondroitin sulfate, heparin and dextran sulfate, competed very effec- 
tively in the binding assay. However, the binding was specific in that, 
under the same conditions, type II collagen did not bind at all. It was 
speculated that such binding might function in vivo to anchor the 
collagen in its fibrillar form to cartilage matrix proteoglycans or, per- 
haps, to sulfated glycosaminoglycans on the cell membrane. It should 
also be considered that if type XI collagen in vivo is uniformly dis- 
tributed throughout the matrix as a copolymer within the same fibrils 
as type II collagen, then one of its functions may be to mediate phys- 
ical interactions between the fibrils and proteoglycans. 
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I. INTRODUCTION 


The isolation and characterization of cDNA and genomic DNA 
clones for collagenous molecules are essential for understanding how 
collagen genes are regulated. In addition, nucleotide sequence analysis 
of DNA provides a powerful approach to primary structure analysis of 
collagens. In this article, we will discuss how molecular biology tech- 
niques are being used to generate structural information about col- 
lagen types. We will first describe some of the general strategies for 
constructing cDNA and genomic DNA clones. This will be followed by 
a discussion of how such clones are identified. Finally, we will exam- 
ine the kinds of structural information that can be easily obtained 
from cloned DNA, illustrated by some recent examples from our own 
studies. 


ll. CLONING STRATEGIES 
A. cDNAs 


1. PURIFICATION AND CHARACTERIZATION OF 
mRNA [poly(A)+ RNA] 


Synthesis of cDNA requires isolation of mRNA. Isolation of intact 
mRNA that is enriched for a specific mRNA of interest is probably the 
* Present address: Dana-Farber Cancer Institute, Boston, Massachusetts 02115. 
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most critical step in a cDNA cloning project and, therefore, warrants 
considerable attention. Subsequent steps, such as cDNA synthesis, 
construction of recombinant plasmid (or phage), transformation, and 
cloning, are now quite routine and can be performed readily with the 
use of commercial kits. The successful isolation of long cDNAs, howev- 
er, depends critically on high quality, well-characterized RNA. Extra 
weeks spent on isolation of RNA could well save months of subsequent 
cloning work. 

Several methods are available for the isolation of RNA. A popular 
method is extraction with SDS in the presence of proteinase K (Bur- 
nett and Rosenbloom, 1979; see also Maniatis et al., 1982). This has 
been used by several investigators for the isolation of RNA from both 
cultured cells and tissues. The method has been used in our own labo- 
ratory with chick calvarial bone tissue with good results. However, we 
did not find the method satisfactory for cartilage tissues. Extraction 
with guanidine hydrochloride (Adams et al., 1977) gave better yields of 
high-molecular-weight (and presumably intact) RNA from chick em- 
bryo sternal cartilage than did the proteinase K technique. The best 
method for collagen mRNA extraction, however, is the use of 
guanidine thiocyanate (Chirgwin et al., 1979). We now use this method 
almost exclusively to isolate high-molecular-weight RNA in high 
yields with a minimum of degradation from a variety of tissues. 

Having obtained total cellular or tissue RNA by one of the above 
methods, the enrichment of poly(A)' RNAs (mRNAs) over rRNAs is 
accomplished by oligo(dT) cellulose (see Maniatis et al., 1982) or paper 
affinity chromatography [messenger affinity paper (mAP)] (Werner et 
al., 1984). If size fractionation of mRNAs is desired, one may use a 5— 
20% sucrose gradient with a denaturant such as methyl mercury hy- 
droxide (Payvar and Schimke, 1979; Myers et al., 1981). Or, as in the 
case of al(I) and a2(I) collagen mRNAs, size fractionation can be ac- 
complished using Sepharose-4B chromatography (Frischauf et al., 
1978). 

Characterizing the RNA before it is used for cDNA synthesis is as 
important as choosing the best method for its extraction from cells and 
tissues. Cell-free translation (Pelham and Jackson, 1976) with a com- 
mercially available lysate and analysis of the translation products by 
SDS-polyacrylamide gel electrophoresis is now relatively simple and 
provides important information about the complexity of the mRNA. 
The use of highly purified bacterial collagenase provides, of course, a 
convenient probe for collagenous polypeptides. 

The characterization of mRNA by cell-free translation analysis can 
also provide data that are essential for selecting an efficient method 
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for screening cDNAs constructed from the mRNA. For example, the 
strategy used for the recent cloning of type X collagen cDNA in our 
laboratory (Ninomiya et al., 1986) was based on cell-free translation 
analysis of RNA isolated from long-term cultured chondrocytes and 
sternal cartilage. Gel electrophoresis of translation products synthe- 
sized with chondrocyte RNA showed a prominent, collagenase-sen- 
sitive band comigrating with the bovine serum albumin molecular- 
weight maker, while products synthesized with RNA from 17-day-old 
chick embryo sternal cartilage only showed a very weak band (Fig. 
1A). On this basis, we constructed a cDNA library from the chondro- 
cyte RNA and screened the library using a differential screening 
method (Ninomiya et al., 1984a). Clones were hybridized with alkali- 
fragmented, 32P-labeled chondrocyte RNA (positive probe) and alkali- 
fragmented, 32P-labeled sternal cartilage RNA (negative probe). One 
of the clones that hybridized strongly to chondrocyte RNA, but did not 
hybridize to sternal cartilage RNA, was selected and shown to code for 
type X collagen (Ninomiya et al., 1986; Fig. 1B). 

Finally, the quality of the RNA may be assessed by Northern blot 
analysis using an existing collagen DNA probe to probe the nitro- 
cellulose blot. For example, in analyzing cartilage mRNA, we have 
found it useful to examine whether type II collagen mRNA is intact or 
degraded. Usually, extensive degradation of one mRNA (type ID, indi- 
cated by a smear of hybridization in a Northern blot, also indicates 
that other mRNAs are degraded. 


2. cDNA SYNTHESIS 


A variety of methods are available for synthesis and cloning of 
cDNA with poly(A)+ RNA as a template. These are described in detail 
in several excellent handbooks and laboratory manuals (Maniatis et 
al., 1982; Perbal, 1984; Glover, 1985; see also Grossman and Moldave, 
1980; Wu, 1979; Wu et al., 1983a,b). In addition, good commercial kits 
are available containing all the necessary enzymes and buffers as well 
as detailed protocols. Since cloning vectors and transformation-compe- 
tent host cells also can be purchased, the whole process of cDNA syn- 
thesis and cloning is becoming rather routine. In fact, some companies 
even supply ready-made cDNA libraries from a variety of sources, and 
it is likely that such libraries will soon become available for an in- 
creasing number of tissues and animal species. 

There are, however, several basic components to the overall pro- 
cedure of cDNA synthesis, and these will be briefly presented. In- 
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cluded are the use of avian myeloblastosis virus (AMV) (Efstratiadis 
et al., 1976) or Moloney murine leukemia virus (M-MLV) reverse 
transcriptase for first strand cDNA synthesis, followed by Escherich- 
ia coli DNA polymerase I (Efstratiadis et al., 1976; Lehrach et al., 
1978), Klenow fragment (Yamamoto et al., 1980), or reverse tran- 
scriptase (Myers et al., 1981) for second strand synthesis. The prim- 
ing of second strand synthesis is accomplished by either the hairpin 
loop that naturally occurs at the 3’ end of the cDNA after first strand 
synthesis or by alkali degradation of the RNA template followed by 
dC-tailing of the first strand and hybridization of an oligo(dG) primer 
to that tail (Land et al., 1981). 

After synthesis of the second strand, S1 nuclease or mung bean 
nuclease is added to remove the hairpin primer loop as well as to create 
blunt ends at the termini of the double-stranded cDNA. If an oligo(dG) 
primer was used for second-strand synthesis, the termini of the cDNA 
can be made flush either with S1 nuclease or by a reaction with E. coli 
DNA polymerase I or T4 DNA polymerase. 

The next step in the cloning process is to create “sticky” termini at 
the ends of the cDNA so that it may be conveniently cloned into an 
appropriate vector. For example, homopolymer tails of dC may be add- 
ed to the cDNA with terminal transferase, allowing it to be cloned into 
the dG-tailed PstI site of pBR322 (Villa-Komaroff et al., 1978). Liga- 
tion of synthetic linkers containing sites for restriction endonucleases, 
such as HindIII or EcoRI, to the cDNA and cloning into the same site of 
pBR322 is another alternative (see Maniatis et al., 1982). 

In order to increase the percentage of cloned high-molecular-weight 
cDNAs, size fractionation of the cDNA population can be performed 
either before or after the creation of sticky ends. One of these methods 
involves the use of a 5-20% neutral sucrose gradient (Lehrach et al., 
1978). Other methods include size fractionation by chromatography on 


Fic. 1. (A) Polyacrylamide gel electrophoresis of cell-free translation products synthe- 
sized with mRNA isolated from long-term cultured chick tibiotarsal hypertrophic 
chondrocytes (lanes 1 and 2) and 17-day-old chick embryo sterna (lanes 3 and 4). Arrow 
indicates the position of type X collagen translation products. The arrowhead indicates a 
second collagenase-sensitive polypeptide, whose identity is presently unknown, migrat- 
ing slightly faster than the Type X band. For the cell-free translation analysis a proline- 
depleted commercial reticulocyte lysate (N150, Amersham) was used with [?Hlproline. 
-, Without bacterial collagenase; +, with bacterial collagenase. (B) Analysis of cDNA 
clones by dot hybridization using %2P-labeled RNA from long-term cultured chondro- 
cytes (a) and 17-day chick embryonic sterna (b) as probes. The arrows indicate a clone 
that hybridized strongly to chondrocyte RNA but only weakly to sternal cartilage RNA. 
Reproduced with permission from Ninomiya et al. (1986). 
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a Bio Gel A-15m column (Wang and Gudas, 1983) or agarose gel 
electrophoresis. 

A variation on the priming of first-strand cDNA synthesis with 
oligo(dT) is the use of either a synthetic oligonucleotide or a short gene 
fragment as a primer. This method is feasible only when the amino 
acid or exon sequence is already known and is most often used to 
ascertain the signal and 5’-untranslated sequence of the mRNA. In 
addition, if the primer is end labeled with [y-32P]ATP and poly- 
nucleotide kinase, the products of first-strand synthesis can be elec- 
trophoresed in a denaturing polyacrylamide gel, purified by elec- 
troelution, and subjected to nucleic acid sequence analysis. In this 
manner the transcription initiation site of the chicken a2(I) collagen 
gene (Vogeli et al., 1981a) as well as the nucleotide sequence of the 5’- 
untranslated region and signal sequence of the chicken a1(I) (Yamada 
et al., 1983a) and al (III) (Yamada et al., 1983a) mRNAs were deter- 
mined. Also, it is possible to clone the primer-extended products as was 
demonstrated with the chicken al(I) (Yamada et al., 1983a) and rat 
all) (Kohno et al., 1984) cDNAs. 

In a method developed by Okayama and Berg (1982) for high-effi- 
ciency cloning of full-length, double-stranded cDNA, the oligo(dT) 
primer is part of the plasmid vector (see Fig. 2; Ninomiya et al., 1984a). 
This primer-vector is annealed to poly(A) RNA and the first cDNA 
strand is synthesized with reverse transcriptase. Terminal transferase 
is utilized to generate an oligo(dC) tail on the 3’ end of the first strand 
while it is still hydrogen-bonded to the mRNA. In the next step, the 
vector is restricted with HindIII, and a short, double-stranded linker 
DNA (with a HindIII sticky site at one end and an oligo(dG) tail at the 
other) is annealed to the vector-cDNA—mRNA complex so that a cir- 
cular molecule is formed. In a final step, the mRNA strand is replaced 
by DNA in a reaction that utilizes RNase H (to remove the mRNA), 
DNA polymerase I (to repair the second cDNA strand by nick-transla- 
tion), and DNA ligase (to close the circular molecule). We have utilized 
the Okayama-Berg method extensively for the cloning of cDNAs of 
cartilage collagens (Ninomiya et al., 1984b, 1985, 1986; Ninomiya and 
Olsen, 1984), and the method has worked well in our hands. However, 
the method is somewhat tedious and involves several steps. Therefore, 
we have recently used a rapid modification (Gubler and Hoffman, 
1983) that includes priming with oligo(dT), first-strand synthesis with 
reverse transcriptase, second-strand synthesis with RNase H, DNA 
polymerase I, and DNA ligase as in the Okayama-Berg method, fol- 
lowed by tailing of the cDNA with oligo(dC) and annealing to PstI-cut 
and oligo(dG)-tailed pBR322 (commercially available). 
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Fic. 2. Diagram illustrating the use of the restriction endonuclease Sau96I for 
screening of collagen cDNAs. The endonuclease cuts within sequences that code for Gly- 
Pro sequences and produces oligonucleotide fragments that are multiples of 9 bp in 
length. The fragments produce a characteristic ladder on 10% polyacrylamide gels. 


B. Genomic Libraries 


1. PURIFICATION OF HIGH-MOLECULAR-WEIGHT DNA 


The purification of high-molecular-weight (HMW) DNA of greater 
than 100 kb is essential for construction ofa genomic library. Only in 
this manner is one insured of having a complete representation of 
clonable restriction enzyme fragments after partial enzyme digestion. 
Methods for preparation of HMW genomic DNA from cells or tissue 
are available (see Perbal, 1984; Kaiser and Murray, 1985). 
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2. LIBRARY CONSTRUCTION 


After HMW DNA preparation, an appropriate cloning vector must 
be selected. Vectors derived from bacteriophage lambda are widely 
employed, but unfortunately have a cloning capacity of only up to 20 
kb. After partial restriction enzyme digestion, genomic DNAs corre- 
sponding to approximately 15-20 kb are size fractionated on a 10-40% 
sucrose gradient (see Kaiser and Murray, 1985). DNAs purified in this 
manner are then ligated to commercially available phage vector arms, 
packaged into infectious phage particles using a commercially avail- 
able kit, and used to infect an appropriate host cell. The phage titers of 
the pooled packaging reactions should yield a total number of plaque- 
forming units (PFUs) which roughly correspond to at least three times 
the number of clones necessary to generate one genome equivalent of 
DNA. Subsequent amplification of the gene library allows multiple 
screenings to be performed using a variety of gene probes. 

The use of bacteriophage lambda as a cloning vector is based on the 
fact that, although the linear DNA molecule is 48.6 kb long, only 60% 
of the lambda genome is needed for propagation of the phage (see 
Maniatis et al., 1982). Thus, the central region is dispensable and can 
be replaced by non-lambda DNA. Such replacement vectors are now 
available in a variety of versions. The original Charon phages, such as 
Charon 4A (Blattner et al., 1977) and Charon 30 (Rimm et al., 1980), 
are still being used (we have used them for the construction of chicken 
genomic libraries) but are increasingly being replaced by vectors that 
contain “polylinkers” with multiple restriction endonuclease sites. In 
addition, replacement vectors, such as EMBL3 and EMBL4 contain a 
functional gam gene in the central, dispensable “stuffer” fragment 
(see Kaiser and Murray, 1985). This makes the parental phage vector 
unable to form plaques on E. coli strains lysogenic for phage P2. 
Therefore, after cutting out the stuffer fragment with a restriction 
endonuclease, it is unnecessary with these vectors to physically re- 
move the fragment from vector arms before they are ligated to the 
exogenous DNA. By growing the library in an E. coli (P2) host, only 
recombinant phages will grow. 

The use of cosmid vectors (Collins and Hohn, 1978) in library con- 
struction was a significant advance in gene isolation technology be- 
cause it allowed the isolation of DNA segments of up to 45 kb in size. 
This provides for a reduction in the number of steps required to walk 
along a chromosome. In addition, with the larger insert size, cosmids 
can incorporate particularly long eukaryotic genes (such as collagen 
genes) within a single insert fragment (Cheah et al., 1985). Cosmids 
combine the drug selectivity of plasmids, and their ability to replicate 
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autonomously within a bacterial host, with the packageability and 
infectivity of bacteriophage lambda. Therefore, once the host cell is 
infected by the phage particle, the cosmid will function entirely as a 
plasmid, and bacterial colonies will be generated. High-density screen- 
ing of the clones can be performed as outlined by Hanahan and Mes- 
elson (1980). However, colony hybridization for cosmids is less effi- 
cient than plaque hybridization for phage. This is, in part, due to the 
fact that plasmid copy number is reduced when the size is increased. A 
second disadvantage of cosmids is their tendency to lose insert se- 
quences by deletion. Obviously, since the copy number increases as the 
insert size is reduced, deleted cosmids outgrow the larger copies. 


III. STRATEGIES FOR IDENTIFYING COLLAGEN 
cDNA AND GENOMIC CLONES 


A. Use of Cross-Hybridizing cDNA and 
Genomic DNA Probes 


Thus, while the synthesis and cloning of cDNA is becoming easier, 
the screening for a specific clone remains a time-consuming effort 
whose successful outcome is critically dependent on the screening 
method used. Therefore, we will discuss in some detail the different 
approaches one can take for screening of collagen cDNA clones. Some 
of the methods can also be used for genomic DNA screening, and we 
will point that out where it is appropriate. 

As soon as cDNAs specific for chicken type I collagen chains became 
available, they were used as cross-hybridizing probes to isolate other 
collagens from genomic libraries. For example, chicken a1(I) and «2(I) 
cDNAs were used to isolate collagen genes from Caenorhabditis ele- 
gans (Kramer et al., 1982) and Drosophila melanogaster (Monson et al., 
1982) as well as from the human al(II) (Cheah et al., 1985) and the 
mouse a1(I) collagen genes (Monson and McCarthy, 1981). Conversely, 
genomic probes can be used in cross-hybridization experiments to iso- 
late cDNA clones, as was demonstrated by Loidl et al. (1984), when a 
human a@l(III) cDNA was identified by cross-hybridization with a 
chick a1(III) gene probe. 

Clearly, the use of cross-hybridizing probes is a preferred method 
when a probe that shows good cross-hybridization with the sequence of 
interest is available. Large numbers of cDNA or genomic DNA clones 
can be easily screened with high-density colony (Hanahan and Mes- 
elson, 1980) or plaque hybridization (Benton and Davis, 1977). This 
approach is limited, however, by the degree of cross-hybridization with 
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the sequence of interest. In addition, due to the large number of homol- 
ogous collagen genes, unwanted cross-hybridizations between differ- 
ent collagen types and other GC-rich sequences can occur, especially 
when screening across species boundaries. Finally, the genes of some 
of the more recently discovered collagens, such as types IX and X, 
show a codon usage distinctly different from that of fibrillar collagen 
genes (even in the same species). As a consequence, cDNA and genomic 
probes for fibrillar collagens did not show hybridization with types IX 
and X DNA and could not, therefore, be used to screen for types IX and 
X collagen cDNAs. 

Obviously, as the number of DNA probes for specific collagens in- 
creases, the likelihood of finding a probe that will detect additional 
collagen types by cross-hybridization will also increase. Therefore, one 
can expect that cDNA and genomic screenings for additional collagen 
types will become easier. 


B. Use of Synthetic Oligonucleotide Probes 


Another method of identification of collagen-containing cDNA and 
genomic clones involves the use of synthetic oligonucleotide probes 
(see Szostak et al., 1979). In these cases the amino acid sequence of a 
peptide has already been determined, while the corresponding DNA 
sequence is unknown (Suggs et al., 1981; Goldberg et al., 1986). Due to 
the degeneracy of the genetic code, a mixture of oligonucleotides must 
be synthesized representing all possible DNA sequences. For example, 
Kohno et al. (1984) used a mixture of 16 tetradecamers, whose se- 
quence was based on the amino acid sequence Phe-Asp-Gln-Lys-Ala 
from the N-telopeptide of rat proal(II) collagen, to both synthesize 
cDNA and screen those cDNAs for a proal(ID) collagen cDNA. 


C. Use of Restriction Endonucleases That 
Cut at Repetitive Codons 


In situations where cross-hybridizing probes or protein data are not 
available, we have successfully used a restriction endonuclease meth- 
od to screen for cDNAs encoding collagenous proteins (Ninomiya and 
Olsen, 1984; Ninomiya et al., 1984a, 1985). The method is based on the 
fact that collagens contain Gly-X-Y amino acid repeats, with X fre- 
quently being a prolyl residue. Therefore, collagen cDNAs will contain 
a repeat of the nucleotide sequence GGx-xxx-yyy, in which x and y can 
be any nucleotide. Whenever an amino acid triplet is Gly-Pro-Y, the 
corresponding nucleotide sequence is GGx-CCx-yyy. This nucleotide 
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sequence contains the recognition site GGxCC for the restriction endo- 
nuclease Sau96l (Fig. 2). Within a cDNA that codes for a collagenous 
sequence, one is therefore likely to find multiple Sau96I sites, and the 
distances between such sites will be multiples of nine nucleotides. By 
digesting collagen cDNAs with Sau96l, end labeling the products 
using the Klenow fragment of DNA polymerase I to fill in the 5’ 
overhangs (created by Sau96I) with [a-?2P]dGTP, a characteristic lad- 
der of bands representing fragments that are multiples of 9 bp in 
length can be displayed by electrophoresis in 10% polyacrylamide gels 
(Fig. 3). We have used this approach to screen cDNAs from libraries 
made with both chondrocyte and fibroblast mRMA. For screening, 
plasmid DNA is isolated from 1 or 1.5-ml cultures of bacterial clones 
picked at random from cDNA library plates. The DNA is digested with 
Sau$96l, labeled with 32P and electrophoresed on 10% polyacrylamide 
gels. Within a few weeks, several hundred clones can be screened by 
this method. 

The method has the advantage of only detecting cDNAs that extend 
far enough from the 3’ end of the mRNA of a collagenous molecule to 
cover a substantial part of the triple-helical coding region. Also, since 
the recognition site for Sau96I, GGxCC, does not place any restrictions 
on the nucleotide in the third base positions of the glycyl and prolyl 
codons, the method allows detection of collagen DNAs that have a 
codon usage substantially different from those of collagens already 
available. The usefulness of the method is illustrated by the successful 
isolation of cDNAs encoding the al and «2 chains of type IX collagen 
(Ninomiya and Olsen, 1984; Ninomiya et al., 1985), and cDNA for a 
novel type IX-like collagen expressed by chick tendon fibroblasts (Gor- 
don et al., 1987). 

The major disadvantage of the restriction endonuclease method is 
that it is relatively slow and labor intensive as compared with colony 
and plaque hybridization methods. A second disadvantage is that once 
a cDNA clone is identified by this method as being collagenous, fur- 
ther characterization (Northern blot analysis, nucleotide sequencing) 
is necessary to establish the identity of the clones. However, as a 
method for detection of cDNAs encoding novel collagens, we consider 
it extremely valuable. 


D. Use of Antibodies for Screening 
of Expression Libraries 


While the restriction endonuclease screening method permits the 
isolation of collagen cDNAs even before isolation and characterization 
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Fic. 3. Polyacrylamide gel electrophoresis of fragments produced by digesting several 
DNAs with Sau96l (courtesy of Dr. Tomoatsu Kimura). DNAs were prepared from 
plasmid “minipreps” digested with Sau96I and end labeled with [a-?2P]dGTP and the 
Klenow fragment of DNA polymerase I, before electrophoresis in a 10% gel. Samples in 
lanes 1 and 20 are standards (type X collagen) showing characteristic fragments of 18, 
27, 36, 45, and 54 base pairs. Samples in lanes 7 and 17 are unknown cDNAs that show a 
clear collagenous ladder. The conditions for endonuclease digestion and labeling are as 
follows: (1) To 10 yl total volume of 20 mM Tris-HCl, pH 7.4, 5 mM MgCl, 1 mM 
dithiothreitol (DTT), 50 mM NaCl (restriction buffer), add plasmid DNA and 1.4 U of 
Sau96I (Bethesda Research Laboratories, Inc.). The amount of DNA per reaction tube is 
equivalent to DNA prepared from 70 ul of a bacterial culture. (2) Incubate at 37°C for 60 
min. (3) Add 1 wl of “Klenow-mix.” The Klenow-mix consists of 1.5 wl (15 pCi) of 
[x-82P]dGTP (NEN, 800 Ci/mmol), 2.0 ul Klenow-fragment (BRL, 3 U/l), and 20 wl 
restriction buffer. (4) Incubate at room temperature for 15 min. (5) Add 1.5 ul of “termi- 
nation buffer” (40% glycerol, 0.2% bromphenol blue, 0.1 M Na4-EDTA). (6) Half of the 
sample is used for electrophoresis. 
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of the protein, and therefore usually long before antibodies are avail- 
able, the use of expression vectors is an alternative in those cases 
where an antibody is available. The identification and isolation of 
gene sequences by the use of specific antibody probes have been de- 
scribed by several groups (see Clarke et al., 1979; Ehrlich et al., 1979; 
Helfman et al., 1985). However, it was not until development of the 
Agtl1 vector system that immunological screening of expressing li- 
braries became a widely popular and successful method for cDNA 
isolation (Young and Davis, 1983, 1985). The approach has led to the 
isolation of cDNAs for several noncollagenous extracellular matrix 
molecules (Schwarzbauer et al., 1983; Doege et al., 1986; Yamada et al., 
1985) but has not yet been successful for collagens. 

The Agt11 expression vector permits insertion of foreign DNA into 
the B-galactosidase structural gene under the control of the lac oper- 
ator (Fig. 4). Propagation of the DNA occurs lytically with expression 
of the foreign DNA repressed by lac repressor. Addition of isopropyl-ß- 
D-thiogalactoside (IPTG) induces production of fusion polypeptides 
containing sequences encoded by the foreign DNA, and these can be 
detected immunologically. Rapid degradation of foreign polypeptides 
in the E. coli host is prevented by using a host cell carrying the lon 
mutation. For cloning of cDNAs encoding well-characterized proteins 
against which antibodies are available, the Agt11 system is clearly the 
method of choice. 


E. Hybrid-Select Translation Analysis 


Identification of cDNA or genomic DNA clones can, in some cases, 
be accomplished by hybrid-select translation analysis. For example, 
Wang and Gudas (1983) used hybrid-select translation to identify 
cDNA clones for type IV collagen and laminin. In the most widely used 
form of the selection procedure, DNA is immobilized on nitrocellulose 
and mRNA is hybridized to the filter (Riccardi et al., 1979). After 
removal of nonhybridized RNA by washing, bound RNA is eluted and 
analyzed by cell-free translation. When combined with immunopreci- 
pitation of the cell-free translation products, the approach is a good 
strategy for identifying cloned DNA. In our experience, however, 
hybrid-select translation is a difficult procedure, and we do not recom- 
mend it as a routine procedure for collagenous proteins. 


F. Identification of Exons in Genomic DNA 


Ideally, the synthesis and cloning of full-length cDNAs for any col- 
lagen mRNA should allow derivation of the primary structure of the 
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Fic. 4. Diagram illustrating the basis for expression cloning using the Agt11 system. 
The Agt11 vector allows insertion of cDNA within the ß-galactosidase gene. By growing 
the recombinant phage in a host cell (Y1090) that produces Jac repressor, synthesis of 
fusion polypeptides is repressed until induced by overlaying phage plaques with nitro- 
cellulose soaked in IPTG. The host cell produces a suppressor tRNA that suppresses a 
phage mutation causing defective lysis (S100). To prepare large amounts of a polypep- 
tide specified by a cDNA, the \gt11 recombinant can be expressed as a lysogen in E. coli 
Y1089. This host produces a temperature-sensitive lambda repressor (active at 37°C, 
inactive at 41°C), it carries a deficiency in the lon protease, and contains a mutation 
which enhances the frequency of phage lysogeny. In addition, Y1089 does not produce 
suppressor tRNA, so it is not lysed by Agt11. Lysogens (colonies that grow at 32°C, but 
not at 42°C) are picked, amplified, induced with IPTG, harvested, and lysed by freezing 
and thawing to produce fusion polypeptides in large amounts. 


FUSION 
POLYPEPTIDE 
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collagen polypeptides from the nucleotide sequence of the DNA. How- 
ever, one frequently encounters difficulties in isolation of full-length 
collagen cDNA sequences. For example, most cDNA clones specific for 
al(I) chains isolated in our laboratory (Ninomiya et al., 1984b), as well 
as those of others (Vuorio et al., 1982; Lukens et al., 1983; Deak et al., 
1985; Elima et al., 1985; Sangiorgi et al., 1985), cover only the 3’ 
untranslated region and the coding sequence for the carboxy] propep- 
tide. The reverse transcriptase sometimes seems to pause or stop at 
specific sites along the mRNA during synthesis of the first strand of 
cDNA (Ninomiya et al., 1984b). 

One way to circumvent this problem is to isolate genomic clones and 
deduce the structure of the protein by nucleotide sequence analysis of 
exons within such clones. Relatively short cDNAs can be used effec- 
tively to screen genomic libraries. In fact, the isolation of a series of 
overlapping clones of the chicken a2(I) collagen gene was accom- 
plished with a cDNA of only 180 bp in one laboratory (Ohkubo et al., 
1980). 

How can exons within a genomic clone be identified? When cDNAs 
are available, the problem is, of course, quite simple. Southern blot 
analysis of restriction endonuclease digests of cloned genomic DNA 
using the cDNA as probe will identify restriction fragments that con- 
tain coding sequences. One is frequently faced, however, with the 
problem of identifying exons within genomic DNA in regions where 
cDNAs are not available. Several alternative strategies are available, 
including R-loop mapping, S1 nuclease protection experiments, North- 
ern blot analysis, and computer analysis of nucleotide sequences. Elec- 
tron microscopic R-loop mapping has been extremely useful for locat- 
ing exons in some of the cloned fibrillar collagen genes (Vogeli et al., 
1981b; Ohkubo et al., 1980; Wozney et al., 1981; Yamada et al., 1983b; 
Chu et al., 1984). If highly enriched mRNA is available, this is clearly 
a fast and simple method of determining gene structures. However, if 
the RNA of interest only represents a very small fraction of the total 
mRNA, electron microscopy may not be possible. The alternative is 
then S1 nuclease protection and/or Northern blot analysis (see Mani- 
atis et al., 1982). 

These methods can provide a relatively accurate mapping of exons 
within genomic clones. However, determination of nucleotide se- 
quences is necessary to precisely define coding sequences and exon/in- 
tron boundaries. Once a sequence is determined, several computer 
methods (see Soll and Roberts, 1984) can be used both to locate likely 
coding regions within it and to define the position of splice junctions 
(Staden, 1985). In our own laboratory, a systematic search for in- 
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tron/exon splice junctions together with a search for sequences encod- 
ing collagenous domains have allowed definition of the exon structure 
of a large portion of the a2(IX) collagen gene (Lozano et al., 1985). 


G. Synthesis of Oligopeptides Based on Nucleotide Sequences 
and Generation of Antipeptide Antibodies 


Both monoclonal and polyclonal antibodies can be generated against 
synthetic oligopeptides coupled to haptens such as hemocyanine (Nim- 
an et al., 1983; Tamura et al., 1983). If the peptide sequence has a high 
hydrophilicity index (Hopp and Woods, 1981), it is likely to be located 
on the surface of the intact protein, and the anti-peptide antibodies 
are, therefore, likely to bind to the intact protein as well. 

We have used this strategy to generate antibodies against each of 
the three distinct polypeptide chains of chicken type IX collagen 
(Konomi et al., 1986), and the antibodies have allowed identification of 
these polypeptides after their electrophoretic separation on poly- 
acrylamide gels. As cDNA and genomic DNA clones for novel col- 
lagens are likely to be isolated before the protein products are identi- 
fied, it is possible that this approach will become more widely used. 


H. Nucleotide Sequence Analysis 


Determination of the amino acid sequence of a collagen polypeptide 
by molecular cloning requires the establishment of nucleotide se- 
quences of cDNAs and genomic DNAs. Nucleotide sequencing can easi- 
ly be accomplished by either the Maxam-Gilbert chemical method 
(Maxam and Gilbert, 1980) or by the dideoxy-chain termination method 
(Sanger et al., 1977). For large sequencing projects where several kilo- 
bases need to be sequenced, subcloning into a single-stranded phage 
vector and sequencing by the dideoxy method is undoubtedly the faster 
method (Messing, 1983; Barnes et al., 1983) and can be used either with 
a shotgun or sequential digestion approach. The shotgun strategy relies 
on fragmenting the DNA with restriction endonucleases (Messing, 
1983), DNase (Anderson, 1981), or sonication (Deininger, 1983). The 
fragments are cloned in M13 and sequenced. The order of the fragments 
is determined by finding overlaps in the sequences. In the sequential 
strategy, the DNA is shortened with an exonuclease such as Bal31 
(Poncz et al., 1982) or exonuclease III followed by S1 nuclease or ex- 
onuclease VII (Guo et al., 1983). We have had extensive experience with 
Bal31 and find it extremely useful for producing overlapping frag- 
ments, as long as M13 plaques for sequencing are picked on the basis of 
hybridization to the initial DNA (G. Lozano, D. McCormick and B. R. 
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Olsen, unpublished results). A drawback with the Bal31 method is the 
need for isolation of double-stranded cloned DNA and the recloning of 
DNA into M13 after the exonuclease digestion. This obstacle is circum- 
vented in a recently described method (Dale et al., 1985) in which a 
sequential series of overlapping clones are generated directly from 
single-stranded M13 clones. 

In our experience, however, not all collagen sequences can easily be 
determined by the dideoxy method, and one must, therefore, rely on 
the chemical degradation method as well. Also, for limited sequence 
analysis of smaller DNA fragments (<1000 nucleotides) the Maxam- 
Gilbert method is probably the method of choice. Several modifica- 
tions have recently been introduced in the chemical method, so that 
the sequencing reactions and the subsequent gel electrophoresis of the 
reaction products can be performed within a day (R. Tisser, personal 
communication). For both the dideoxy and the chemical methods, 
wedged sequencing gels that are thicker at the bottom than at the top 
are most useful. The compressed sequence ladder that forms at the 
bottom of wedged gels allows more nucleotides to be read than is the 
case with gels of even thickness. Also, gradient gels (Biggin et al., 
1983) can be used with similar results, but they are more difficult to 
pour than wedged gels. 

The use of the dideoxy method and [a-35S]dATP (Biggin et al., 1983), 
combined with drying of the sequence gels before autoradiography, 
routinely allows the reading of 300-350 nucleotides from a single gel. 
With multiple loadings of the gels and extended electrophoresis times, 
this number can be increased to 500 or more. 

Good computer programs are available for analysis and handling of 
nucleotide sequence data (see Soll and Roberts, 1984). Several pro- 
grams are available commercially. A sonic digitizer is extremely 
useful for entering sequence data directly from gels into a microcom- 
puter, thus eliminating the need for writing sequences down and en- 
tering them via the keyboard of the computer. 


IV. STRUCTURAL PROTEIN INFORMATION THAT CAN BE 
EASILY OBTAINED FROM CLONED DNA 


A. Amino Acid Sequences Derived 
from Nucleotide Sequences 


Obviously, the amino acid sequence of a polypeptide can be deter- 
mined by conceptually translating the nucleotide sequence of cloned 
DNA. Figures 5 and 6 illustrate the results obtained when translating 
the nucleotide sequence of the type X collagen gene. If the available 
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Fic. 5. See legend on p. 304. 
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GGC CCG CCT GGT GAG 
Gly Pro Pro Gly Glu 


GGA CAA AAA GGA GAA 
Gly Gln Lys Gly Glu 


AAC AGG GGT CTT CCA 
Asn Arg Gly Leu Pro 


TCT GCT GGG ATA GGG 
Ser Ala Gly Ile Gly 


CCA GGT ATG AAA GGT 
Pro Gly Met Lys Gly 


GCT GGT ATC CCA GGC 
Ala Gly Ile Pro Gly 


GTT GGC ATC GGC AAG 
Val Gly Ile Gly Lys 


GGC ATC CCC GGA GCC 
Gly Ile Pro Gly Ala 


GGA TCC CCA GGT CTT 
Gly Ser Pro Gly Leu 


ATG AAG GGA CAT AGA 
Met Lys Gly His Arg 


CCT AAA GGA GAC CAA 
Pro Lys Gly Asp Gln 


See legend on p. 304. 
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Gly Phe Gly 
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Fıc. 5. See legend on p. 304. 
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2207 


TGA CAT ACT CTC CTT CAG GAC ATT TTA ACT GGG CAC TGA TGT GAT 
aK 


681 


2252 
ATG TCC TTA GCA GGT CAT GTA ACT TCA ATT TTG CAA CAG AAA TCG 


2297 
AAA AGA GTG AAA AGG TAA AGA AAT TGG TTC AAA ACT AGT AGA AAA 


2342 
ATA TTG TAT GTT TTT AGA AAC TTG TCA TGG GAA ATT TGT TIT GAA 


2387 
CAA TGG TCA ATT GTC CAA CTG AAT GCA ACA TTA AAA TTT TAT GAA 


2432 } 
TTA ATA TAA TAC ACC TGT CCA TCT CTC TTC CTT AAA AAA AGG GGG 


2477 
GGG TGA AGG GGG GGG GGG AGG GGG GAG GGC AAA TCA GAT TTT CTA 


2522 
AGT ATC ATT TCT CAT TTA ACA GGT AAA GTA GCC ATT CCA AGA CTC 


2567 
TAG ACC ACA CTC GGG TAC ACT GAA TGA TAC CTT CAG TTG ATT CA 


Fic. 5. Nucleotide sequence of the type X collagen gene and the amino acid sequence 
of the conceptual translation product. Noncollagenous sequences are indicated by boxed 
areas. Short horizontal one-sided arrows indicate amino acid residues that were con- 
firmed by direct amino acid sequencing of two cyanogen bromide peptides of type X 
collagen. The two types of imperfections in the collagenous domain are underlined with 
single lines for the short type (Gly-X-Gly) and double lines for the longer type (Gly-X-Y 
X-Y-Gly). The two short vertical arrows indicate the 5’ and 3’ ends of the sequence 
contained within the cDNA pYN3116 [the poly(A) segment excluded]. Within the car- 
boxy-terminal noncollagenous domain, a relatively hydrophobic amino acid sequence of 
29 residues between 2 lysyl residues is indicated with a heavy line. Reproduced with 
permission from Ninomiya et al. (1986). 


DNA sequence covers the whole translated portion of the mRNA, the 
complete sequence of the translation product can be deduced quite 
easily. Residues that frequently are difficult to determine by amino 
acid sequencing of peptides, such as serine, threonine, asparagine, and 
glutamine, present no problem at the DNA level. However, since hy- 
droxyproline and hydroxylysine are not encoded by specific codons, 
these residues can only be determined by protein sequencing. For col- 
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=42<-—_____ 460AA ~— 170AA — 


NH2; 


: Met 


Fıc. 6. Diagram summarizing characteristic features of the type X collagen transla- 
tion product as deduced from the nucleotide sequence of the gene. If we assume that the 
methionyl codon at nucleotide 191 represents the start of translation, the polypeptide 
contains 42 amino acid residues of noncollagenous sequence at the amino end and 170 
residues of noncollagenous sequence at the carboxyl end. The collagenous domain is 460 
amino acid residues long. Short horizontal bars indicate the locations of short imperfec- 
tions (Gly-X-Gly); solid squares indicate the locations of long imperfections (Gly-X-Y-X- 
Y-Gly). The polypeptide sequence contains a large number of methionyl residues (open 
circles) and a concentration of tyrosyl residues (filled circles) within the carboxy-termi- 
nal noncollagenous domain. Cysteinyl residues are indicated by open squares. Re- 
produced with permission from Ninomiya et al. (1986). 


lagens, therefore, the substantial information introduced into polypep- 
tide chains by posttranslational modifications cannot be determined 
by nucleotide sequence analysis. 


B. Size of Primary Translation Products 


For proteins undergoing extensive posttranslational modifications, 
including proteolytic processing, primary structure determination by 
nucleotide sequencing enables one to determine the size of the primary 
translation products with relatively little effort. In addition, for col- 
lagens that frequently have been characterized as pepsin-resistant 
proteolytic fragments, it is, of course, extremely valuable to be able to 
define noncollagenous sequences without having to isolate intact poly- 
peptides from cells or tissues. A good example from our own laboratory 
is type IX collagen. Only after the isolation of a cDNA for the a1(X) 
collagen chain (Ninomiya and Olsen, 1984) did the structure of type IX 
collagen become clear. The sequence of the cDNA immediately sug- 
gested that the molecule contained three triple-helical domains sepa- 
rated by noncollagenous domains, and that the collagenous fragments 
HMW and LMW (see van der Rest and Mayne, this volume) were 
derived from the same molecule. 
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V. FURTHER Use oF DNA PROBES 


Although the isolation and characterization of cDNA and genomic 
DNA specific for various collagen types permits deduction of the pri- 
mary structure of collagenous polypeptides, the work on the molecular 
biology of collagens is primarily aimed at answering questions of reg- 
ulation of gene expression. Also, the analysis of collagen gene struc- 
ture will permit a genetic classification of collagens and will lead to 
elucidation of evolutionary relationships between collagen types. 

Finally, by modifying cloned genes in vitro and inserting them into 
cells and organisms, it should be possible to directly address the ques- 
tion of what functional roles the different collagen types play in extra- 
cellular matrices. 


VI. SUMMARY AND CONCLUSIONS 


The use of molecular cloning techniques has, in the past few years, 
led to rapid yet significant progress in the characterization of genetic 
types of collagen. Clearly, by their speed and simplicity, these tech- 
niques represent a superior method for determination of collagen pri- 
mary structure. When used in conjunction with protein and peptide 
chemistry, amino acid sequencing, peptide synthesis, and monoclonal 
antibody technology, molecular biology promises to provide definitive 
answers to questions of collagen structure and function both in normal 
extracellular matrix formation and in disease states. 
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Amino acid sequence 
of collagen type IV, 66-73 
of collagen type IX, 200 
of collagen type X, 299-304 
around cross-linking sites of collagen 
type I, 20 
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a chains, 114 
at site of amino propeptide cleavage, 
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Amino propeptides, see also specific col- 
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Anchoring fibrils, see Collagen type VII, 
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collagen type V in, 270 
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collagen type IX in, 195, 215-218 
collagen type X in, 223-257 
collagen type XI in, 195, 268-270 
cDNA 
cloning strategies, 283-291 
purification [poly(A)* RNA], 283-285 
libraries for collagen type IV, 59 
library from chondrocyte RNA, 285 
sequence analysis of, 298-299 
synthesis, 285-289 
Cell-free translation, 284 
Cells, interaction of 
with collagen type I, 25 
with collagen type IV, 65 
a-Chain composition, table of, for all 
collagen types, 2; see also specific 
collagen types 
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collagen type IX in, 201 
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collagen type X in, 223-257 
differentiation of, in cartilage zones, 
224 
Chondroitin sulfate, in collagen type IX, 
205-209 
CL glycoprotein, see Collagen type VI 
Cloning vectors, 290 
Collagenase 
cleavage site, interstitial collagens, 20 
collagen type I, 20-21 
TCA, 21 
TCB, 21 
collagen type IV specific, 64 
rate of cleavage of different collagen 
types, 21 
Collagen receptors on cells, 25 
Collagen type I 
binding to cells, 25 
a-chain composition, 9 
cross-links, 14-17, 19-20 
domain structure of, 13-14, 17-24 
electron microscopy, 4—7 
gene structure, 28—34 
primordial gene unit, 28-32 
signal peptide, 32 
triple-helical domain, 31-32 
glycosylation of hydroxylysine in, 12— 
13 
helix structure of, 4—8 
hydrophobic interaction within, 18— 
19 
quarter stagger alignment and, 7-8 
striations and, 7-8 
hydroxylation of proline and lysine in, 
11-13 
long spacing fibrils (FLS), 4 
molecular length 
amino propeptides, 21-22 
carboxy propeptides, 22-23 
67-nm periodicity, 8 
role of propeptides in control of fiber 
formation, 23-24 
sequence at site of amino propeptide 
cleavage, 24 
posttranslational processing, table of, 
10 
segment long spacing (SLS), 5 
telopeptides, 23 
Collagen type II 
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cell-surface receptor for, 25 
a-chain composition of, 10 
gene structure of, 28-34 
glycosylation of, 13 
propeptides in, 21—23 
Collagen type III 
a-chain composition, 10 
gene structure of, 28-34 
propeptides in, 21-23 
Collagen type IV 
amino acid sequence 
al(IV) chain, 66-71 
a2(IV) chain, 71-73 
biosynthesis, 59-61 
carbohydrate in, 54 
a-chain composition, 50-53, 57-58 
cross-links, 55-56 
degradation, 63-64 
disease, involvement of, 64-65 
Goodpasture syndrome, 64 
domain structure, 50-58 
NC1 domain, 56-57 
7 S domain, 44-45, 55-56 
disulfide bond stabilization of, 55-56 
extraction of, from EHS tumor, 50 
helical region, interruptions in, 52 
hydroxylation of proline, 3-hy- 
droxyproline, 54 
immunochemistry, 61-63 
interactions with 
cells, 65 
heparin sulfate proteoglycan, 65-66 
laminin, 65—66 
macromolecular assembly 
hexagon model, 48-49 
macromolecular structure, 44—49 
molecular length, 45 
network model, 48-49 
rotary-shadowed images, figures, 
44-47 
molecular biology of, 58-59 
molecular dimensions, 45 
pepsin fragments, 50-54 
segment long spacing crystallites, 44 
Collagen type V 
biosynthesis, 86-88 
carbohydrate, O-linked, 85 
a-chain composition, 81-83 
collagenase, resistance of collagen type 
V to, 95 
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degradation 
collagenase, resistance of collagen 
type V to, 94 
sensitive to gelatinase, 95 
distribution 
in cartilage, 270 
in extracellular matrix, 92-94, 95— 
97 
disulfide linkage between chains, 90 
domain structure of procollagen type 
V, 83-86 
functions of, 92—94 
interactions of 
with cells, 94 
with thrombospondin, 94 
molecular dimensions, 85 
precursor molecules, 84-86 
processing of, 88-92 
rotary-shadowed images, 84—85 
similarities with other fiber-forming 
collagens, 34-35 
sulfation, of tyrosine residues, 88 
Collagen type VI 
amino acid composition, 112 
oligomeric forms, 114, 117—119 
biosynthesis, 127-131 
carbohydrate, 117 
glycosylation of hydroxylysine, 117 
N-linked oligosaccharides, 117 
reaction with lectins, 117 
CD spectrum of, 114 
a-chain composition, 111, 113 
as component of microfibrils, 122-127 
cysteine residues, disulfide bonds, 116, 
129 
degradation, resistant to bacterial, 
collagenase, 116-117 
distribution 
in cartilage, 270 
in extracellular matrix, 105-107, 
122-127, 135-139 
perpendicular to collagen and elastin 
fibers, 137 
domain structure, 114-115 
elastin fibers, lack of association with, 
137 
in fibrotic diseases, 140 
functions of, 139-140 
GP-140, GP-250, 106, 133-135 
hydroxylation, proline and lysine, 117 


313 


immunology, 131-135 
macromolecular structure 
electron microscopy, 108-110 
microfibrillar assembly, 119-127 
rotary-shadowed images, figures, 
109-110, 120 
MFP I, 106, 133, 139 
models for aggregation, 121 
optical rotation of, 114 
purification of, 107-108 
SC1, SC2, SC3, 113 
Collagen type VII 
amino acid composition, table, 150 
anchoring fibrils, 163-172 
anchoring plaques, 166—170 
network model, figure, 167 
in basement membrane zone, 162-172 
biosynthesis, 155-162 
a-chain composition, 148-151 
collagenase, helical region insensitive 
to mammalian collagenase, 155 
cyanogen bromide peptides from, 154 
cysteine residues, effects of disulfide 
bond reduction on gel mobility, 
149 
diseases of, 171 
distribution 
in extracellular matrix, 162—170 
table, 163 
domain structure, 155-161 
NC1, 159-160 
NC2, 160-161 
function, 170-171 
helix-to-coil transition, 153 
immunology, antigenic epitopes on, 
155-156 
interchain disulfide bonds, 154 
isolation from skin and amnion, 146 
macromolecular structure, 148-151 
disulfide-linked dimer, 151 
electron microscopy, rotary shadow- 
ing, 152, 160 
model of molecular assembly, 160 
molecular dimensions, table, 153 
pepsin products, P1 and P2, 150 
processing of procollagen type VII, 
161-162 
rotary-shadowed images, 152, 160 
segment long spacing (SLS) 
crystallites, 162—163 
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Collagen type VIII 
biosynthesis, secretion of, by cultured 
cells, 175-179 
a-chain structure, 190-191 
collagenase, degraded by interstitial, 
183 
cyanogen bromide peptides, 182 
Descemet’s membrane, 178, 188-190 
distribution in extracellular matrix, 
188-189 
EC1, EC2, EC3, 175-179, 183 
hydroxylation of, 179 
3-hydroxyproline, 181 
secreted in absence of, 187 
interactions of, with, 183 
molecular structure 
cassette model, 190-191 
triple-helical domains, 191-192 
pepsin fragments, 184 
propeptides, lacking in, 183 
relationship to other collagen types, 
192 
Collagen type IX 
biosynthesis, 215, 218 
synthesis of, relative to collagen 
type II, 219 
carbohydrate in 
mannose, 209 
sulfated glycosaminoglycans, 202— 
203, 205-209 
a-chain structure, 196 
cysteine residues, disulfide bonds in, 
213-214 
distribution 
codistributed with collagen type II, 
216-217 
in extracellular matrix, 215-218 
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LMW (low molecular weight), 198-205 
amino-terminal sequence of, 200 

macromolecular structure 
conformational stability of, 212-213 
electron microscopy, rotary-shadow- 

ing images, 197, 203 

models of, 196, 204 

molecular biology of, 203-205, 218 

pepsin fragments, 199 

processing, 201 

as proteoglycan, 205-209 

relation to other collagens, 218 


Collagen type X 


visualized after hyaluronidase diges- 


tion, 216-217 
domain structure 
collagenous domains (COL1-COL3) 
196, 210-213 


3 


noncollagenous domains (NC1-NC4), 


196, 210-213 
HMW (high molecular weight), 198— 
205 
hydroxylation of proline in, 204, 211, 
212 


hydroxylysine in, glycosylation of, 211 


immunology of, 203 


amino acid 
composition, 230-232 
sequence, 299-304 
biosynthesis of, 225-226, 242-246 
carbohydrate in, glycosylation of hy- 
droxylysine, 232 
collagenase, susceptibility to, 229, 
236-240 
cyanogen bromide peptides, 232-234 
cysteine, disulfide bonds, 232 
distribution in extracellular matrix, 
246-254 
domain structure 
collagenous domain, 240—241 
noncollagenous domains (NC1, NC2), 
227, 241 
functions of, 254-255 
hydroxyproline, 231 
isolation of, from culture medium, 
226-229 
matrix vesicles, 255 
molecular biology, gene of, no introns, 
223 
molecular structure 
conformational stability of, 223, 231, 
234-236 
electron microscopy, rotary shadow- 
ing, 228 
model of molecular structure, 305 
nucleotide sequence of, 299-304 
pepsin fragments, 229 
segment long spacing crystallites 
(SLS), 241 
size, relative to other collagen types, 
223 


Collagen type XI or la2a3a 


amino acid composition, 270-271 


INDEX 


biosynthesis, 274—275 
a-chain structure, heterotrimer, 270— 
273 
cross-linking, 273-274 
cyanogen bromide peptides, 264—266 
degradation of, 275-277 
diseases and, 278 
distribution 
distributed with collagen type II, 268 
in extracellular matrix, 265-270 
table, 267 
functions, 278-279 
hydroxylysine 
enrichment in, 271 
glycosylation of, 271 
immunology of, 277-278 
isolation of, 262-263 
macromolecular structure, group 1 
fibrils formed by, 273 
pepsin fragments, 265 
proteases, susceptibility to, 275-277 
with proteoglycans, interactions of, 
279 
shared properties with collagen type V, 
261, 270 
Collagen type K, see Collagen type XI 
Collagen types, table, 2 
Cosmid vectors, 290-291 
Cross-links, 14-17 
catalyzed by lysyl oxidase, 16 
in collagen type IX, 214 
formation, figure, 16 
lathyrism, 14 
in 7 S domain of collagen type IV, 56 
Cutis laxa, collagen type VI in, 140 
Cysteine, intermolecular disulfide bridge 
in collagen type IV, 55 


D 


Degradation, see also specific collagen 
types 
3,4-Dehydroproline, 12—13 
Dermatosparaxsis, 13-14 
amino extension peptide not removed, 
13 
missing amino-terminal procollagen 
peptidase, 13 


315 


Descemet’s membrane, collagen type VIII 
in, 178, 188—190 

Dideoxy nucleotide sequencing, 298-299 

Drosophila melanogaster, collagen genes 
in, 291 


E 


EC (endothelial collagen), see Collagen 
type VII, 173 

EHS tumor, extraction of collagen type 
IV from, 50 

Elastase, degradation of collagen type 
IV, 63 

Elastin fibers, lack of association of col- 
lagen type VI with, 137 

Electron microscopy, collagen type I, 4— 
7; see also Rotary-shadowed, images of 
of 

Endochondral ossification, 223 

Endothelial cells, collagen type VIII in, 
173 

Epidermolysis bullosa, 171 

Exons, identification in genomic DNA, 
295-298 

Expression libraries 

Agtll, 296 
screening with antibodies, 293-295 


F 


Fibronectin, lack of binding to collagen 
type VIII, 181-182 


G 


p-Galactose, linked to hydroxylysine, 12 
Galactosyltransferase, 12 
Gelatinase, degradation of collagen type 
V, 95 
Gene structure 
collagens type I, II, and III, 28-34 
evolution of collagen gene, 36 
Genomic DNA 
clones for mouse collagen type IV, 58 
identification of exons in, 295-298 
sequence analysis of, 298-299 
Genomic libraries 
library construction, 290-291 
purification of DNA, 291 


316 


2-O-a-D-glucopyranosyl-O-ß-p-galac- 
topyranosylhydroxylysine, 12 
D-Glucose, linked to hydroxylysine, 12 
Glucosyltransferase, 12 
Glycosaminoglycans, sulfated forms in 
collagen type IX, 205 
Glycosylation, of hydroxylysine 
in collagen type I, 12-13 
in collagen type II, 13 
in collagen type VI, 117 
in collagen type XI, 270-271 
Goodpasture syndrome and collagen type 
IV, 64 
Agtl1 expression library, illustration of 
use in cloning, 296 


H 


Hemidesmosomes, 170 
Heparin sulfate, interaction with col- 
lagen type IV, 65-66 
Hexagon model of collagen type IV as- 
sembly, 48-49 
HMW, see Collagen type IX 
Hyaluronidase, unmasking of antigenic 
epitopes of collagen type IX in car- 
tilage, 216-217 
Hybrid-select translation, 295 
Hydroxylation of lysine 
by lysyl hydroxylase, cofactors, 11 
number of residues hydroxylated vari- 
able between collagens, 13 
sequence specificity, 11 
Hydroxylation of proline 
by proline 3-hydroxylase, 11-12 
by proline 4-hydroxylase, 11 
reaction cofactors, 11 
sequence specificity, 11-12 
Hydroxylysine, sites for glycosylation, 12 
in collagen types I, II, and III, 11-13 
in collagen type VI, 117 
Hydroxylysinonorleucine cross-link in 
collagen type I, 15 
Hydroxyproline 
3-hydroxyproline 
in collagen type IV, 54 
in collagen type VIII, 181 
4-hydroxyproline 
in collagen type I, 11-13 
role in helix stabilization, 12 
Hypertrophic cells, collagen type X in, 
224 
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I 
Iron, cofactor for hydroxylation, 11 
L 


Lamina densa, relationship to collagen 
type VII, 167-170; see also Base- 
ment membrane 

Laminin, interaction with collagen type 
IV, 65-66 

Langer-Saldino dwarfing syndrome, 278 

Lathyrism, 14 

Lectins 

collagen type VIII does not bind, 183 
interaction of collagen type VI with, 
117 

Libraries, see Genomic libraries or ex- 
pression libraries 

LMW, see Collagen type IX 

Low-molecular-weight collagen, see Col- 
lagen type X 

Lysyl oxidase, 16 


M 


M1, M2, see Collagen type IX 

M13, 298-299 

Mast cell protease, 183 

Matrix vesicles, and collagen type X, 255 

Maxam-Gilbert nucleotide sequencing, 
298-299 

Microfibrils, 105-107, 122-127 

in different tissues, table, 126 

mRNA, purification [poly(A)' RNA], 

283-285 


N 


Network model of collagen type IV as- 
sembly, 49 


(0) 
2-Oxoglutarate, cofactor for hydroxyl- 
ation, 11 
P 
Pepsin 


fragments of collagen type IV, 51 
fragments of collagen type VII, 158 
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fragments of collagen type VIII, 184 
fragments of collagen type IX, 199 
fragments of collagen type X, 229 
fragments of collagen type XI, 265 
solubilization of collagen type VI, 106 
treatment of collagen type I removes 
telopeptides, 26 
Phenylboronate-agarose, purification of 
collagen type VI, 107 
Platelets, interaction with collagen type 
I, 26 
Probes 
cDNA and genomic DNA, 291-292 
cross-hybridizing, 291 
restriction endonuclease-generated, 
292-293 
synthesis of oligonucleotide, based on 
nucleotide sequence, 298 
Procollagens, see also specific collagen 
types 
Proteoglycan 
binding of collagen type XI to, 279 
collagen type IX as, 205-209 
PG-Lt, see Collagen type IX 


R 


Restriction endonucleases, 289, 292-294 

Rotary-shadowed, images of 
collagen type IV, figures, 46-47 
collagen type V, description, 84-85 
collagen type VI, figures, 108-110, 120 
collagen type VII, figures, 152, 160 
collagen type IX, figures, 197, 203 
collagen type X, figures, 228 
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S 


Sau96l restriction endonuclease, for 
screening collagen. cDNAs, 289, 
294 
SC1, SC2, SC3, see Collagen type VI 
7 S domain of collagen type IV, 44—45, 
55-56 
Segment long spacing (SLS) crystallites 
collagen type I, 5-6 
collagen type IV, 44 
collagen type VII, 162-163 
collagen type IX, 198 
collagen type X, 241 
Sequence analysis, nucleotide, 298-299 
Southern blot analysis, 297 
Sulfation 
of collagen type IX, 202 
of tyrosine residues in collagen type V, 
88 


T 


TCA, TCB, collagenase cleavage sites in 
collagen type I, 21 
Telopeptides, in collagen type I, 23 
Translation 
cell-free, 284 
hybrid-select, 295 
Triple-helical domain 
gene structure in interstitial collagens, 
28—34 
interstitial collagens, 18 
of type VII collagen, 148 


